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MODIFICATION OF VIRUS TROPISM AND HOST RANGE 
5 BY VIRAL GENOME SHUFFLING 

CROSS-REFERENCES TO RELATED APPLICATIONS 

. This appUcation is a continuation-in-part of 08/962497 filed 10r31- 

The present application claims benefit of the'08/962, 197 application, which is 
^ -incorporated herein by reference in its entirety for all purposes. , 

10 STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER 

FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT 

this invention was partially made with federal support, NIST-ATP grant # 

97-0 1 -0240. The government may have some rights in the present invention. 

FIELD OF THE INVENTION 

15 The invention relates to methods and compositions for forced evolution of 

! . ■ ' ^ ' ' ' ' 

a virus genome, such as a genome of an HIV- 1 virus strain; to produce a variant virus 
having an altei-ed phenotype that provides a desired property that may be advantageous 
for development of small animal models of viral diseases, and for the development of 
novel therapeutic approaches to viral diseases, .among others (e.g.; evolving a virus to 
20 rephcate in an advantageous tissue culture system). The invention relates to novel viral 
genomes and virions which are capable of replication in non-human animals and cells, 
and further relates to transgenic non-human animals and cell lines capable of supporting 
replication of such evolved virus variants. The invention also relates to methods' for ; 
identifying novel antiviral agents. " 

25 BACKGROUND OF THE INVENTION 

HIV-I ANE) AIDS ' \ . 

Human immunodeficiency virus type I (HIV- 1 ) is a human retrovirus that 

is believed to be an etiologic agent of aqquired inunune deficiency syndrome (AIDS), an 

infectious disease characterized by a profound loss of immune system fimction. An 

30 aspect ofHW-1 disease is the typically delayed onset of disease symptoms, such as 

opportunistic infections, Kap osi^s sarcoma, dementia, and wastin g syndrome. Often it 

may take 1 0 to 1 5 years after initial infection before symptoms are evident; however, in 
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some instances disease onset is quite rapid. Moreover, the specific patiiology of HIV-1 
disease can be quite variable between individuals and between strains of the BIV-l virus 
. "(for a review, see Field's Virology, Third Edition, Fields et,al. Eds., Lippincott-Raven 
■ Publishers, Vol. 2, Chapters 60 and 61). At present, HIV-1 appears to be almost always 
pathogenic in humans, and although certain chemotherapeutic agents (e.g., proteas^ ' 

inhibitors, nucleoside analogs) have shov\'n clinical promise in arresting or slowing HIV-1 
disease, there is no estabUshed cure or preventative for HIV-1 disease at present. 
. ' Unfortunately, the HIV- 1 virus is also characterized by an extraordinainly 

high frequency of mutatiohal change, including deletions, base 

insertions, and recombinations between HIV- 1 genomes. It has been estimated that on 
, the average at least one quarter of the progeny virus from a single cycle of retrovirus 

'replication will have some kirid'ofmutation relative to the parent genome, and 
recombination will further recoifibine these variant genomes (Temin KM (1989) Genome 
■ 31': 1 7). This'charactenstic of HIV-l (and other lentiviruses, such as retroviruses) makes 
5- it difficult to obtain therapeutic solutions which the virus .cannot escape due ,to its 

■ inherently high rate of mutation and propensity' to generate variants which are resistant to . 
: . the partieulai^therapeutic solution selected. ;tor this Veason,:the currently used therapeutic 
method to treat HIV-1 disease is to combine a cocktail of multiple chemotherapeutic 
agents (e.i, protease inhibitors and nucleoside analogs)to make .it less likely th^ 
10 resistant variant can arise during therapy. Nonetheless, it is almost certain that resistant 
■HW-1 variants will arise, particularly in view 0 

chemotherapeutic regimens,pharmacogenetic differences between individual 
bioavailability of the chemotherapy agents, and use of partially degraded or inaccurately 
dispensed chemotherapy agents in less^advanced nations, 

,5 The glpbaUy circulating strains oif HIV-1 exhM^^^^ 

(Robertson et al. (1995) Nature 374: 124). To evaluate the extent of global HlV-1 
variation, sequences of virus strains originating from numerous countries have been 
compared. These studies have shown that HIV-l can be classified into two major groups, 
designated M and O, which are defined as distinct clusters on phylogenetic trees. Groups 

30 M comprises the great majority of HIV-1 isolates and can be further subdivided into at 
least nine sequence subtypes or clades, designated A to I, with additional variants being 
added to the classification scheme continually (Gaoetal. (1996) J. Virol. 70:1651). 
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Given this degree of diversity, it is, widely believed that a vaccine based on a single strain 
or subtype of HTV-l will be unsuccessful against the larger spectrum of globally 
circulating HR^-l variants, as well as against new variants which continually arise. 
Furthermore, the HIV-1 virus appears to undergo sequence variation and functional 
mutation in patients; isolates from different phases of HIV- 1 infection exhibit stage- 
specific replication characteristics (Asjo et al. (1986) Lancet 2: 660; Cheng-Meyer et al 
(1988) Science 240: 80; Fenyo et al. (1988) J. Virol. .62: 4414; Tersmette (1989) J. 

Virol. 63:2118). 

In view of the propensity of HIV- 1 to undergo rapid mutation and 
generate variants that are resistant to chemotheraputic agents and candidate "universal" 
vaccines, it is desirable to have non-human animal models of HIV- 1 replication and 
disease in order to speed the identification and development of new generations of 
antiviral agents that can be used to treat resistant HIV- 1 variants, or to prevent the 
generation of such variants in vivo. Unfortunately, such non-human models of HIV- 1 
disease are presently lacking. 

NON-HUMAN MODELS OF HIV-1 DISEASE 

^ The absence of a suitable animal model has remained one of the major 

barriers to the development of an effective therapy for HIV-1 infection. Ideally, a readily 

available small animal model that could sustain HIV-1 infection and develop clinical 

symptoms that reflect the disease in humans wodd prove useful for modeling * 

pathogenesis and developing new antiviral agents. An animal model that could duplicate 

human immune responses would greatly facilitate the development of vaccines. 

- Unfortunately, no current model fulfills these varied needs (for review see, Klotman et al. 

(1995) AIDS 9: 313; Chang etal. (1996)Transfus. Sci. 17: 89; and Bonyhadi ML and 

KaneshimaH(June, 1997) Molec. Med. Today pp; 246-253; Mosier DE (Sept.,1996) 

Hosp! Prac. Pp. 41-60). 

In general, non-human animals are not susceptible to infection with HIV- 1 
(Morrow et al (1987) J. Gen. Virol. 68: 2253). However, several animal models exist in 
which to study retroviruses related to HIV-1 and their related pathology; these include 
SIV in macaque monkeys, FIV in cats, and murine acquired immunodeficiency syndrome 
virus (MAIDS) in mice, among others. HIV-1 replicates weakly in chimpanzees, but 
^lii^^noTdetectablelhsea's^^ . 
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suited for large-scale Studies, Lewis AD arid Johnson PR (1995) TIBTECH 13: 142 ' " ^ 

discuss various non-human animal model systems and their limitations. 

Several Hiy-2 isolates, including three molecular clones : 
(Hiy-2ROD,HIV-2SBL-ISY;andffly t 
macaques (M. mulatta and Ivll nemestrina) or baboons (Franchini, et al. (1989) Proc. Natl. 
Acad, Sci.'.U.S.A. 86, 2433- 2437; Bamett, et al. (1993) Journal of Virology. 67, 1006-14; . 
Boeri, etal/(1992) Journal of Virology66, 4546-50; Castro,^^ ^ 
219-26; Tranchini, etaL (1990) Journal of Virology 

Aids 4, 78^-9; Putkonen, et al. (1991) Nature 352, 436-8). " . ^ 

10 As alternatives to the above, models of HIV- I path^ " . - ' 

; ' experimentally derived in mice that are transgenic for portions of the HIV genome or an 
\ ' . entire HIV- 1 genome, as well as in SCID mice which have been reconstituted with HIV- 

■ infected irnmune cells (Ramezani et al. (1996) Transm^ ' - 

': ' ^' (1996)«Gp.cit). HIV transgenic mice have been developed to model the in vivo regulation 
15 and pathological consequences of expression of various HIV open reading frames . - 
(OREs), including known HIV structural genes; a 
" have been obtained from expression in various tissues of such animals 
' • expression in T cells ofHIV transgenic mice has been neglig^^^ 

iimitationofthese miceasHIVdisease models.. Furthennore, amajo^ , 
20. moilse system is the inability of HIV to irife^ 
, ; transduced with the gene for human CD4, the major receptor protein, for HIV- 1 

■ infectivity. ■ . ■■ ^ ■ ■ ■ ' \ ' , . • - , " ' . ' ^ 

■ * ^ scro/hu mice have been reported 35 candidate aaimai 

Hiy-1 (Mosier DE (1996) op.c^^^ 
' 25. :i5{)5^BonyhadiML:andKanes^ 

and B ceils, and have been successfully engrafted with hum^ he^^^ 
(e.g., the SCDDAiu mouse having engrafted human thymus and liver tissue, peripheral 
blood leukocytes (PBLs), or hematppoietic precursor cells (Kamel-Reidetal. (1988) 
Science 242: 1706; McCune et al. (1988) Science 241 : 1632; Mosier et al. Nature 335: 
30 256). Such xenochimeric SCID/hu or SCID/hu-PBL mice have been used to study HTV 
pathogenesis in vivo and to evaluate anti-HIV drugs (Mosier et al. (1991) Science 25 1 : 
791; Mosier et al. (1995) Science 260: 689; McCune et al. (1990) Science 247; 564; 
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Ruprecht et al.^ (1992) AIDS Res. Hum. Retroviruses 8: 997). However, these SCID 
mice models produced certain results which were anomalous, such as when infected with 
non-cytopathic macrophage-tropic (in humans) HIV isolates the mice underwent a rapid 
depletion of CD4+ cells, but when infected with cytopathic, T.cell-tropic HIV isolates the 
5 CD4+ cells were not depleted, the exact opposite of what occurs in the human. 

Thus, the art continues to search for improved models of HIV disease 
using small animal models and different (i.e., non-HIV) viruses. The absence of a 
suitable animal model has remained one of the major barriers to the development of an 
effective therapy for HIV- 1 infection. It is apparent from the foregoing^that a need exists 

10 in the art for an improved model of HIV-1 infection to .further the development of anti- 
HIV therapies and prophylactic agents. 

Significant improvements to and new opportunities for anti-HW therapies 
and antiviral screening methods could be realized if better models of HIV- 1 replication 
and pathogenesis were available. The present invention meets these and other needs and 

15 provides such improvements and opportunities. 

The references discussed herein are provided solely for their disclosure 
prior to the filing date of the present application. Nothing herein is to be construed as an 

- ' admission that the inventors are not entitled to antedate such disclosure by virtue of prior 
invention. All publications cited are incorporated herein by reference, whether 

20 specifically noted as such or not. 

SUMMARY OF THE INVENTION 

The present invention relates to methods for generating viral genotypes 

encoding at least one modified viral tropic phenotype, such as infectivity, virulence, and 
pathogenesis in a cell type, tissue, or host animal species (commonly host range; defined 
25 herein as a subset of viral tropism). The tropic phenotype modification can either permit 
or restrict, viral infection, replication, and/or cytopatic effect in a predetermined cell type 
and/or host species (e.g., a non-human mammal). A basic format of the method, termed 
viral genome shuffling, in broad application, consists of: (1) contacting a cell strain, cell 
Ime, or non-human animal (or explanted organ therefi-om), which does not naturally 
30 support substantial repUcation of an predetermined virus, with at least one initial 
infectious virion or replicable genome of said predetermined virus imder replication 
conditionsr(2) recovering-a plurality of replicated-genome copies of said predeterniined 
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Vims, either as virions or as viral genomes in polynucleotide form, wherein some or all of' 
the rq)licated genome copies comprise a mutation relative to the initial infectious virion 
or replicable genome, (3>recombining a plurahty of said replicated. ^^^^^ 
to shuffle the mutations,.thereby generating a collection of recombined replicated genome; 
5 copies, and (4) selecting or screening said collection of recombined replicated genome 
copies to obtain one or more replicable viral ge 
tropic phehotype. It is 6ftendesirable,toperform,at least one adi^^^^^ 
whereby the coilection of recombined replicated genonie copies is contacted w^ 

^ - strain; cell line; or nbn^humah aiiimal (or explanted organ therefrom) under replication - 
0 condiLnstoproduceseeond(orsubsequent)roundreplicatedcopieshayingad 
. mutations, and to recover and shuffle, by recombination, said second (or subsequent) 
round replicated genome copies prior to thestep of selecting. or screening for genomes. ' ■ 
. V encoding a modified viral tropic phenotype. Typically, the recombination in step (3) is 

■ performed in vitro:.or by an in vivo recombination method which substantially does not ; 
15. occur naturally during replication bf said viral genome. -In certain variations, naturally 

. - occuiTing in vivo recombination mechanisms can be used in conjunction with a collection 

■ of preselected virus variants having aliesired phenotypic property to be optimized further; 
\ in tMs way, a natural viral recombination mechanism can 

• , selection ofvariants in aaiterative manner to produce p^^^^^ • 

20 evolution", wherein the forced evolved virus variants ar^ not expected to, nor are 

: observed to, occur in nature, nor are predicted to occur at an. appreciable frequency. The 
: ' practitioner may fiirther elect'to supplemented/or the mutational drift by introducing 
mutatedviralgenomes,orportionsthereof,intothepoolofinitialinfectiousvirions(or 

replicable genomes) and/brinto the pWrality genome copies which are 

21 recombined. Mutaiiorial drift also be ^ 

-chemical mutagens or mutagenic irradiation), or by employing replication conditions 
which enhance the mutation rate of the vims. 

, • • . The invention also provides for the viral genomes and infectious virions, 
produced by the method of viral genome shuffling; the exact stmctures of said produced 
30 viral genomes and infectious virions are definable a priori only by reference to the 
method by which they are generated. Thus, the invention includes a viral genome, or 
plurality thereof, produced by the methods described herein. The shuffled viral 
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genome(s) produced thereby are easily distinguishable from naturally occurring viral 
genomes by virtue of their atypical modified viral tropic phenotype(s) which is/are 
normally not present in the population of naturally occurring viral genomes. 

In a variation of the basic method, one or more portions of the yiral ■ 
5 genome are separately optimized or improved for function in the predetermined cell type 
and/or host species as distinct genetic elements isolated from the remainder of the viral 
genome. The optimized or improved portions of the viral genome are then either 
introduced into the initial viral genome(s) for use in the method, or are shuffled in by 
recombination with the replicated genome copies recovered after a round of replication in 
10 the host cell or host animal. In a variation, the optimized or improved portions of the 
viral genome can be used in conjunction with one or more heterologous polynucleotide 
' sequence(s), such as non- viral genes or replicons to confer a desired functional or 
structural property, such as transcriptional regulation or translational regulation, to the 
heterologous sequence(s). Optimized or improved portions of a virus genome often can 
15 be marketed as a commercial product, either alone or in combination with one or more 
heterologous sequences. 

' The invention also encompasses compositions of such shuffled viral . 
genomes encoding at least one modified viral tropic phenotype. The compositions can 
include a plurality of species of shuffled viral genomes, or can represent a single purified 
20 viral genome species. Certain shuffled yiral genomes encode variant viruses which 

possess detectable phenotypes that are not naturally occurring and which can be selected 
for; selected phenotypes often are characterized by desirable properties, such as modified. ' 
host range as compared to wildtype virus, modified cell tropism as compared to wiidtype 
virus, and modified immunogenicity, among other desirable properties. 
25 The invention also encompasses screening assays and kits comprising a 

composition of such shuffled viral genome(s) and a cell type, tissue, or host animal 
species for which said shuffled viral genome(s) encode a modified viral tropism or drug 
resistance phenotype. In an aspect, the screening assay or kit further comprises a test 
agent, which is typically a small organic molecule such as a nucleoside analog or protease 
30 inhibitor with a molecular weight of less than 3 ,000 Daltons. In an aspect, the cell type or 
host animal is transgenic and expresses at least one human protein which confers, either 
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alone or in conjunction with one or more additional human protein species, susceptibility 
of a cell to infection by and/or replication of said predetermined virus. 

The invehtion provides screening assay .methods, for identifying and 

quantitating pharmacological properties of antiviral compounds. An exemplary screening 
5 assay format to identify agents that modify replication of a virus, said method comprises 
the steps of: (1) contacting, iinder suitable conditions, a test agent with a screening 
composition comprising: (i) one or more variants of said virus evolved by viral genome 
shuffling so as to repHcate in cells or in anon^human animal, which cells or non-human 
^ animal d6es not naturally support replication of said virus, and (ii) said cells or non- 
0 human animal, and (2) determining whether said test agent modifies replication of said 
variant(S) in said cells or non-human animal. In an embodiment, the test agent inhibits 
' :virus replication; although it should be possible to screen for test agents which modify 

. other aspects of viral replication, such as replication potentiators, immune modulators (for 
' use with non-human' animai' systems), and agents which modify the virus genetic 
5 ; expression program (e,g, late gene inhibitors, latency modifiers, and the like)". 

Although the methods of the inven-ion are beheved to be suitable for use- 

with substantially my virus type, including plant viruses, bacterial viruses, and animal 
• viruses, it is described with particular reference to HIV-.l for illustrative purposes. It is 
believed that, with regard to animal viruses, the method will find particular use in 
[0 developing shuffled virus genomes pf pathogenic or oncogenic viruses for which present^ 
day noh-human animal models are insufficient or lacking entirely. fflV-i and the related 
HIV disease is only, one example of such suitable viruses and their pathologies. 

■ ■ With reference to HIV-1, the invention provides a method for producing, 
by viral genome shuffling, at least one HIV-1 variant which is capable of substantial 

^5 replication in a npn-humari cell type. |n an enAodim^nt, the invention provides a m^^ 

for generating one or more HlV-lgenome(s) which encode(s) the phenotype of \ 
permissive replication in mouse cells that express at least one human protein which 
confers, either alone or in conjunction with one or more additional human protein species, 
susceptibility of a mouse cell to infection by andyor replication of HIV-1 . Examples of 
30 such human susceptibility proteins for HIV-1 infection include, but are not limited to 

hCD4, hCCR5. hCXCR4, and other accessory proteins identified in the art. Often, non- 
human primate homologs of these proteins can be substituted. In an aspect, the method 
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employs a transgenic non-human cell or animal containing at least one expreission cassette 
which encodes and expresses at least one human HIV- 1 susceptibility protein. The viral 
genome shuffling method using these transgenic cells and/or animals as replication media 
produces shuffled HIV variants which have improved tropism for infection and/or 
5 replication of the transgenic non-human cells or animals. The shuffled HIV variants may 
be backcrossed (e.g., by recombination) to one or more HIV isolate(s), with concomitant 
selection for retention of the property of improved, tropism for the transgenic cells or 
animals, thereby retaining the minimal mutations necessary for the desired tropic 
phenotype while "nativizing" the remainder of the viral genome to conform with the 

10 chosen HIV isolate(s). By the use of backcrossing, it is believed possible to generate, by 
use of the method of the invention, HIV variants substantially corresponding to 
essentially any HIV clinical isolate or sequence-related category thereof (e.g., group, 
clade, etc.), wherein the variants possess a desired phenotypic property not naturally 
associated with HIV; an example of such a phenotypic property can be the capacity for 

1 5 substantial replication in non-human cells and non-human organisms, such as for example 
mouse cells and transgenic mice. 

In an aspect, the methods of the invention can be used to modify the 
inmiunogenic properties of a virus (i.e., the phenotype being selected for is an 
immunological property). For example, a virus (or collection of virus species) can be 

20 evolved to evade a host organism immune system, such as a human or mouse immune 
system. Also for example, a virus (or collection of virus species) can be evolved so as to 
mimic one or more immunologic stages of virus evolution in vivo; e.g., the viral 
dynamics of HIV-l infection "of a human patient is chi*acterized by a cohtiriual natural 
evolution of certain immunodominant viral epitopes so as to naturally evade the human 

25 immune system - the present invention can be used to generate HTV- l variants which ■ 
mimic one or more later immunological stages of HIV infection; such variants may serve 
/as candidate HTV-l vaccines, among other uses. 

In an aspect^ the methods of the invention can be used to modify the 
metabolic properties of a virus (i.e., the phenotype being selected for is a resistance to one 

30 or more chemotherapeutic agent). For example, a virus (or collection of virus species) 
can be evolved to rapidly model the natural development of drug resistance to anti-HIV 

HnigR_The present invention c an be used to g enerate HIV-l variants which are drug 
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'reristfflit; such-yarifflits.can.-be used in-screcning.assays^to identify alternative ^ ; 
chemotherapeutic agents to which the HIV variants are not cross-resistant, among other 

' uses. ' ■ ■ ' " 

The disclosed method for altering a viral phenotype by it 

shuffling and phenotype selection is a pioneering method which enables a brpad range of 

novel and advantageous viral and non-.viral compositions, therapeutic and prophylactic 

methods and compositions, and apparatus which will be apparent to those skilled in the 

art in view of the present disclosure. , 

Other features and advantages of the invention will be apparent from the 

following description of the drawings, preferred embodiments of the invention, the 

examples, and the claims. ■ - ^ 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 . Block-diagram oifan embodiment of the basic method for viral 

genome shuffling and selection for a desired phenotype. Solid boxes and arrows: indicate ■ 
principal steps, outlined boxes and dotted arrows indicate altemative.or optional steps or 

iterations. , '• 

Figure 2, Exemplary schematic of recombinatorial shuffling o"f a 

■ . collection of viral genomes having a variety of mutations or distinct genome portions. 
Sequence variations are indicated with an (X) and a novel open reading frame.(ORF) • 
present in one viral isolate but absent in the others is identified as a box. 

Figure 3.' Schematic portrayal of virus evolution by viral genome shuffling 
to produce variants capable of substantial replication in a non-human animal, such as a 

mouse. V '■ 

Figure 4. (A) Schematic portrayal' of how chimeric oUgonucleotide 

primers can be used to generate.a SHIV genome by PCR-b^ed shuffling of a mixture of • 
an HIV genome and a SIV genome using the chimeric primers^ An ''H" indicates a 
portion complementary to an HIV sequence and a "S" indicates a portion complementary 
to a SrV sequence. (B) Structural diagram of SIV/HIV chimeric genome for shuffling 
and propagation in monkey cells. 

Figure 5 . Diagram showing an example of a low crossover library of 
sequence-recombined polynucleotides resulting firom coarse granularity shuffling, as 
compared to a high crossover hbrary resuliing from a fine granularity shuffling. Each 
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unique color (shade) represents nucleotide sequence from a distinct sequence variant 
(e.g., from a mutated parental sequence, from a plurality of viral isolates or clades, etc.). 

Figure 6. Schematic Diagram for Construction of shuffled library. 

Figure 7. Schematic diagram of Passaging of a Shuffled Library to select 
5 for CHO-Tropic virus. 

Figure 8. Structure of recombinant CHO-tropic envelope showing 
contributions from three parents. - • 

Figure 9. HIV- 1 Evolution Decision Tree. • 

DEFINITIONS 

10 Unless defined otherwise, all technical and scientific terms used herein 

have the same meaning'as commonly understood by one of ordinary skill in the art to 
which this invention belongs. Although any methods and materials similar or equivalent' 
to those described herein can be used in the practice or testing of the present invention, 
the preferred methods and materials are described. For purposes of the present invention, 
15 the following terms are defined below. 

The term "reassembly" is used when recombination occurs between 
identical polynucleotide sequences. 

By contrast, the term "shuffling" is used herein to indicate recombination 
between substantially homologous but non-identical polynucleotide sequences, in some 
20 embodiments DNA shuffling may involve crossover via nonhomologous recombination, 
such as via cre/lox and/or flp/frt systems and the like, such that recombination need not 
require substantially homologous polynucleotide sequences. Homologous ^d non- 
homologous recombination formats can be used, and, in some embodiments, can generate 
molecular chimeras and/or molecular hybrids of substantially dissimilar sequences. Viral 
.25 recombination systems, such as template-switching and the like can also be used to 
generate molecular chimeras and recombined viral genomes, or portions thereof. 

The term "related polynucleotides" means that regions or areas of the 
polynucleotides are identical and regions or areas of the polynucleotides are heterologous. 
The term "chimeric polynucleotide" means that the polynucleotide 
30 comprises regions which are wild-type and regions which are mutated. It may also mean 
that the polynucleotide comprises wild-type regions from one polynucleotide and wild- 
~~~type regiohs'fromMdth'er rielated pol^^ 
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" • The tem "cleaving" means digesting the polynucleotide with enzymes or 
breaking the polynucleotide (e.g., by chemical or physical means), or .generating partial 
length copies of a parent sequence(s) via partial PGR extension, PGR stuttering, 
differential fragment amplification, or other means of producing partial length copies of 
one or more parental sequences. 

, The term "population" as used herein means a coUectiori of components 

such as polynucleotides, nucleic acid fragments or p^^^^^^ A "mixed population:' means 
a collection of components which belong to the same family of nucld^ 
(i^e. are related) but which differ in their sequence (i.e. are not identical) and hence in 

their biological activity. 

• The term "mutations" means changes in the sequence of a parent nucleic 

■ -acid sequence (e.g., a gene or a.viral genome) or changes in the sequence of aparent 

' polypeptide. Such mutations may be point mutations such as transitions or transversions. . 
The mutations may be deletions, insertions or duplications. ' 
5 . ' The term "recursiye sequence recombinati9n" as used herein refers to a 

' ; method whereby a jiopulation of polynucleotide sequences are recombined with each 
other by any suitable recombinalion meaSs (e.g., sexuaFPCR, homoiogous ; ' " ; 

recombination, site-specificTecombination, etc.) to generate a library of sequ^ 

. recombined species which is then screened or subjected to selection to obtain those . 
10 ' sequence-recombined species having a desired property;' the selected species are then . 
subjected to at least one, additional cycle of recombmation with meinseives and/or with . 

■ other polynucleotide'species and at subsequemseleciicm^ . 

. property. 1 ' , ' • . . 

The term "amplification" means that, the nuniber of copies of a nucleic acid 

25 fragment is increased. • ■. ; .„.!, . ■- 

The terni "naturally-occurring" as used herein as applied to an objects refers 

to the fact that an object can be found in namre. For example, a polypeptide or 
polynucleotide sequence that is-present in an organism (including viruses) that can be 
isolated from a source in nature (e.g., including from human patients) and which has not 
30 been intentionally modified by man in the laboratory is naturally-occurring. As used 
herein, laboratory strains of rodents which may have been selectively bred according to 
classical genetics are considered naturally-occurring animals. As used herein, naturally- 
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occurring viruses are those viruses, including naturalvariants thereof, which can be found ' i ' 
in a source in nature, including virally infected individuals. 

As used herein "predetermined" means that the cell type,- non-huinan, / 
animal, or virus may be selected at the discretion of the practitioner on the basi s of a 
'5 known phenotype. 

As used'herein, "linked" means in polynucleotide linkage (i.e., • 
phosphodiester linkage). "Unlinked" means not linked to another polynucleotide, . 
sequence; hence, two sequences are unlinked if each sequence has a free 5* terminus and a 

free 3' terminus. .< • -* - * 

10 • As used herein, the term "operably linked" refers to a Hnkage of 

polynucleotide elements in a fimctional relationship/ A nucleic aci^ . ' n) 

when it.is placed into a functional relationship with another nucleic acid sequence. . For 
instance, a promoter or enhancer is operably linked to a coding sequence if it affects the 
transcription ofthe coding sequence.' Operably linked means that the DN A sequences 
15 being' linked are typically contiguous and, where necessary to join two protein coding 4 ■ ' 

regions, contiguous and in reading frame. However, since enhancers generally function , |i ' - 

when separated from the promoter by several kilob^es and intronic sequences nlay be of 
variable lengths, some polynucleotide elements may be'bperably linke • v|,l' 

contiguous. A structural gene'(e.g., a HSV tk gene) which is operably linked to a t 
2Q polynucleotide sequence corresponding to a transcriptional regulatory sequence of an . ,'^|/: , 

' endogenous gene is generally expressed in substantially the sanie temporal and cell t)^^^ , , , ' 

specific pattern as is the'naturally-occurring gene. 

As used herein, the tenns "expression cassette"! refers to a poiynucleotid^^^ 
comprising a promoter sequence and, optionally, an enhancer and/or silencer element(s), ^ 
25 operably linked to a structural sequence, such as a cDNA sequence or genomic DNA 
sequence. In some embodiments, an expression cassette may also include 
polyadenylation site sequences to ensure polyadenylation of transcripts. When an 
expression cassette is transferred into a suitable host cell, the structural sequence is 
transcribed from the expression cassette promoter, ^d a translatable message is . ^ 
30 - generated, either directly or following appropriate RNA splicing. Typical^^^ 

expression cassette comprises: (1) a promoter, such as an SV40 early region promoter, , ' 
— -HSVtk promoter or-phosphoglycerate-kinase(pgk)promoter,-or^other-s — — — - 
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known in the art. (2) a cloned polynucleotide sequence, such as a cDNA or genom'C 
iagment ligated to the promoter in sense orientation so that transcription from the 
. promoter wiU produce a RNA.that encodes a functional protein, and.(3) a polyadenylation 
sequence. For example and not limitation, an expression cassette of the invention may, 
5 comprise the cDNA expression cloning vectors. pCD and ^MT (Okayama H and Berg P 
■(1983)Mol. Cell. Biol. 3:280;GkayamaHandBergP(1985)M61. Cen^^^ Biol. 5: 
. 11 36, incorporated herein by reference). 

■ .The tenn "transcriptional-modulatioh" is used 
capacity to either enhance branscription or inhibit trans^^ 
id iinkedincis;suchenhancementorinhibitionmaybecontingentontheoccurrenceofa 

specific event, such as stimulation with an inducer and/or may only be manifest in certain 
cell types. The altered abiUty to modulate transcriptional enhancement or inhibition may ■ , 
^^affect the inducible transcription of a gene or may effect the basal level transcription of a 
'■ gene, or both. Numerous other specific examples. of transcription regulatory elements, 
15 , such'as . specific enhancers and silencers, are known to those of skill in the art and may be , 
. .•selected for use in the methods and polynucleotide constructs of the invention on the^basis 
; ' of thVpractitioner's desired application. Literature sources and published patent • ■ 
. . documents; as well as GenBank and.other'sequence'informationdata sources can be^ ' . , 
' consultea by those ofskill in the art in selecting suitable tr^^ 
. 20 foruseintheinvention. Wherehecessary,atranscriptionre^^^^^^ . 
constructed by synthesis (and ligation, if necessary) 

ofavailable sequence information (e.g.,.GenBank sequences for a CD4enhanc^^^^^ , 

■ SV40 early promoter): ' 

■ . : ^ ^ As used herein, the term "transcriptional unit" pr "transcriptional complex" 
: 25 , refers to a" polynucleotide sequence that comprises ^ structural gene (exons), a cis-acting ■, 

. linked.promoter and other cis-acting sequences necessary for e^^^^^ 
structural sequences, distal regulatory elements necessary for appropriate tissue-specific 
and developmental transcription of the structural sequences, and additional cis sequences 
important for efficient transcription and translation (e.g., polyadenylation site, mRN^ 

30 stability controlling sequences). 

As used herein, the term "transcription regulatory region" refers to a DNA 
sequence comprising a fimctional promoter and any associated transcription elements 
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(e.g., enhancer, CCAAT box, TATA box, SPl site, etc.) that are essential for transcription 
of a polynucleotide sequence that is operably linked to the transcription regulatory region. 

As used herein, the term "xenogeneic"%is defined in relation to a recipient 
viral genome, mammalian host cell, or nonhuman animal and means that an amino acid 
sequence or polynucleotide sequence is not encoded by or present in, respectively, the 
naturally-occurring genome of the recipient viral genome, mammalian host cell, or 
nonhuman animal. Xenogenic DNA sequences are foreign DNA sequences; for example, 
human APP genes or immunoglobulin genes are xenogenic with respect to murine ES 
cells; also, for illustration, an HS V tk gene is xenogenic with respect to an HIV-i 
genome. Further, a nucleic acid sequence that has been substantially mutated (e.g., by 
site directed mutagenesis) is xenogenic with respect to the genome from which the 
sequence was originally derived, if the mutated sequence does not naturally occur in the 
genome. 

As used herein, the term'"minigene" or "minilocus" refers to a 
heterologous gene construct wherein one or more nonessential segments of a gene are 
deleted with respect to the naturally-occurring gene, Typically, deleted segments are 
intronic sequences of at least about 100 basepairs to several kilobases, and may span up to 
several tens of kilobases or more. Isolation and manipulation of large (i.e., greater than 
about 50 kilobases) targeting constructs is frequently difficult and may reduce the 
efficiency of transferring the targeting construct into a host cell. Thus, it is frequently . 
desirable to reduce the size of a targeting construct by deleting one or more nonessemtial 
portions of the gene. Typically, intronic sequences that do not encompass essential 
regulatory elements may be deleted. Frequently, if convenient restriction sites bound a 
nonessential intronic sequence of a cloned gene sequence, a deletion of the -intronic 
sequence may be produced by : (1) digesting the cloned DNA with the appropriate 
restriction enzymes. (2) separating the restriction fragments (e.g., by electrophoresis), (3) 
isolating the restriction fragments encompassing the essential exons arid regulatory 
elements, and (4) ligating the isolated restriction fragments to form a minigene wherein 
the exons are in the same linear order as is present in the germline copy of the naturally- 
occurring gene. Alternate methods for producing a minigene will be apparent to those of 
skill in the art (e.g., ligation of partial genomic clones which encompass essential exons 
x_butJwhichlack-portions_ofintroiiic_sequence)._MosL^^ 
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comprising a minigene will be arranged in the same, linear, order as is present in the 
germline gene, however, this will not always be the case. . Some desired regulatory. , 
elements (e.g;, enhancers, silencers) may be relatively position-insensitive, so that the : 
regulatory element will function correctly eyen if positioned differently in a minigene ^ 
than in the corresponding gemline gene. For example, an enhancer may be located at a 
different distance from a promoter, in a different orientation, and/or in a different linear, 
order. For example, an enhancer that is located '3' to a promoter in germline configuration 
• might be located 5' to the promoter in a minigene. Similarly, some genes may have exons 
which are alternatively spliced at the RNA level, and thus a minigene may.have fewer 
exons and/or exons in a different linear order than the corresponding. germline gene and 
still encode a furictionargene prbduet. ; A cDNA encoding a gene product may also be . 
used to construct a minigene. However, since it is generally desirable that the - 
heterologous'm'inigene be expressed similarly to the cognate naturally-occurring 
nonhuman: gene, transcription of a cDNA minigene typically is driven by a Unked gene 
promoter, and enhancer from the naturally-occurring gene. , 

■ As used herein, the terms "endogenous DNA sequence" and "endogenous 

. target sequence" refer to pol>Tiucre6tide.sep^ cell. Such ^ 

sequences include, for example, chromosomal sequences (e.g., structural genes,. 
promoters,;ehliancers, recombinatorial hotspots, repeat sequences, integrated proviral- 
sequences); apisomal sequences (.e,g . . ; : 

intermediates),chloroplast and mitochondrial DN^ ; . 

polynucleotide is a polynucleotide which is transferred into a eukaryotic cell but which 
has not been replicated iii that host cell;, for example, a virus genome polynucleotide that 
enters a cell by fusioni of a virion to thexell :is m exogenous polynucleotide, however, 
replicated copies of the viral polynucleotide subsequently made in tiie infected cell are 
endogenous sequences (and may, for example, become integrated into a cell 
chromosome): Similarly, transgenes which are transfected into a cell are exogenous 
polynucleotides, however integrated and replicated copies of the transgene(s) are 
endogenous sequences. 

The term "corresponds to" is used herein to mean that a polynucleotide 
sequence is homologous (i.e., identical) to all or a portion of a reference polynucleotide 
sequence, or that a polypeptide sequence is identical to a reference polypeptide sequence 
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In contradistinction/ the term "complementaiy to'' is used h^^ 

complementary sequence is homologous to all or a portion of a reference polynucleotide 
sequence. For illustration, the nucleotide sequence "5*-TATAG" corresponds to a 
reference sequence "5*-TATAC" and is complementary to a reference sequence "5'- 
5- GTATA". ' ; • 

The following terms are used to describe the sequence relationships . . 
between two or more polynucleotides: "reference sequence", "comparison window", 
"sequenceidentity", "percentage ofsequence identity", and "substantial identity". A , 
"reference sequence" is a defined sequence used as a .basis for a sequence comparison; a 
10 reference sequence may be a subset of a larger sequence, for example, as a segment of a 
full-length viral gene or virus genome. Generally, a reference sequence is at least 20 
nucleotides in length, frequently at least 25 nucleotides in length, and often at least 50 
nucleotides in length. Since two polynucleotides may each comprise (1) a sequence (i.e., 
a portion of the complete polynucleotide sequence) that is similar between the two 
15 polynucleotides, and (2) a sequence that is divergent between the two polynucleotides, 
sequence comparisons between two (or more), polynucleotides are typically performed by 
comparing sequences of the two polynucleotides over a "comparison window" to.identify 
and compare local regions of sequence similarity. 

A "comparison window", as.used herein, refers to a conceptual segment of 
20 at least 25 contiguous nucleotide positions wherein a polynucleotide sequence may be 
compared to a reference sequence of at least 25 contiguous nucleotides and wherein the 
portion of the polynucleotide sequence in the comparison window may comprise 
additions or deletions (i^e., gaps) of -20 percent or less as compared to' the reference 
sequence (which for comparative purposes in this manner, does not comprise additions or 
25 deletions) for optimal alignment of the two sequences. Optimal aligiiment of sequences 
for aligning a comparison window may be conducted by the local homology algorithm of 
Smith and-Waterman (1981) Adv. Appl. Math. 2: 482, by the. homology alignment 
algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 48; 443, by the search for 
similarity method ofPearson and Lipman (1988) Proc. Natl. Acad. Sci. (U.S.A.) 85: 
30 2444, by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, 
and TFASTA in the Wisconsin Genetics Software Package Release 7.0, Genetics 
Computer Group, 575 Science Dr., Madison, WI), or by inspection, and the best 
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alignment (i.e., resulting in the highest percentage 
window) generated by the various methods is selected. . 

The term "sequence identity" means that two polynucleotide sequences are 
identical (i.e., on a nucleotide-by-nucleotide basis) over the window of comparison. The 
• term "percentage of sequence identity" is calculated by comparing two optimally aligned ■ 
sequences over the window of comparison, detemiining the number of positions at which 
the identical nucleic acid base (e.g., A. T, C, G„U, or I) occurs in both sequences to yield 
the number of matched positions, dividing the number of matched positions by the total 

number of positions in iho window of comparison (i.e. the window size), and multiplying 
the result by 100 to yield the percentage of sequence identity. The term "substantial 
identity!' as used herein denotes a characteristic of a polynucleotide sequence, wherein the 

■ polynucleotide comprises a sequence that has at least 80 percent sequence identity, 
preferably at least 85 percent identity and often 89 to 95 percent sequence identity, more , 
usually at least 99 percent sequence identity as compared to a reference sequence over a 
comparison window of at least,20 nucleotide posittons,. optionally over a window of at- 
least 30-50 nucleotides; wherein the p.rcentage of sequence identity is calculated, by 
comparing the reference sequence, to the polynucleotide sequence that may include,: 
deletions or additions which total 20 percent or less of the reference sequence over the 
window of comparisori: The reference sequence may be a subset of a larger sequence, for 
example, as a segment of the full4ength viral gene or vi 

'' : Specific hybridization is defmed herein as the.formation, by.h 

bonding or nucleotide (or nucleobase) bases, of hybrids between a probe polynucleotide 
(e g a polynucleotide of the invention and a specific target polynucleotide (e.g., a viral 

, gene' or virus genome), wherein the probe preferentially hybridizes to the specific target 

^ such that, for example, a single band corresponding to, e.g.. one or more of the. RNA 
species of the ^ene (or specifically cleaved or processed RNA species) can be ide^^^^ 

or. a Northern blot of RNA prepared firom a suitable source (e.g., a virus-infected somatic 
cell expressing the viral gene). Such hybrids may be completely or only partially base- 
paired. Polynucleotides of the invention which specifically hybridize to viral genome 
0 sequencesmaybepreparedonthebasisofthesequencedataprovidedhereinand 

available in the patent applications incorporated herein and scientific and patent 
publications noted above, and according to methods and thermodynamic principles 
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known in the art and described in Sambrook et al. et ah, Molecular Cloning: A 
Laboratory Manual, 2nd Ed:, (1989), Cold Spring Harbor, NY; Berger and Kimmel, 
Methods in Enz}TOology, Volume 152, Guide to Molecular Cloning Techniques (1987), 
Academic Press, Inc., San Diego, CA; Goodspeed et al. (1989) Gene 76: 1; Dunn et al 
5 (1989) J. Biol. Chem. 264: 13057, and Dunn et al.' (1988) j. Biol. Chem. 263: 10878, 
which are each incorporated herein by reference. 

As used herein the term "replication conditions" refer to aqueous 
conditions wherein a virus or virus genome is capable of undergoing at leaist one principal 
step of viral replication, wherein the principal step can include: attachment of virion to 

10 host cell, entry of viral genome into host cell, uncoating of virus, polynucleotide 

replication (RNA transcription (plus or minus strand), reverse transcription, DNA- \ 
templated DNA polymerization, viral gene expression, encapsidation, budding, and the . 
like. In general, conditions which result in a rephcation phenotype (see, infra) are 
rephcation conditions. Often, suitable replication conditions can be physiological 

15 conditions. "Physiological conditions" as used herein refers to temperature, pH, ionic 
- strength, viscosity, and like biochemical parameters that are compatible with a viable 
organism., and/or that typically exist intracellularly in a viable cultured mamjnalian cell, 
particularly conditions existing in the nucleus of said mammalian cell. For example, the 
intranuclear or cytoplasmic conditions in a mammalian cell grown under typical 

20 laboratory culture conditions are physiological conditions. Suitable in vitro reaction 

conditions for in vitro transcription cocktails are generally physiological conditions, and 
may be exemplified by a variety of art-known nuclear extracts. In general, in vitro 

" physiological conditions can comprise 50-200 mM NaClof KCl, pH 6.5-8:5, 20-45°C and 
0.001-10 mM divalent cation (e.g., Mg-H-, Ca-H-); preferably about 150 mM NaCl or 

25 KCl, pH 7.2-7.6, 5 mM divalent cation, and often include 0.01-1.0 percent nonspecific 
• protein (e.g., BSA). A non-ionic detergent (Tween, NP-40, Triton X-100) can often be 
present, usually at about 0.001 to 2%, typically 0.05-0.2% (v/v). Particular aqueous 
conditions may be selected by the practitioner according to conventional methods. For 
general guidance, the following buffered aqueous conditions may be applicable: 10-250 

30 mM NaCl, 5-50 mM Tris HCl, pH 5-8, with optional addition of divalent cation(s), metal 
chelators, nonionic detergents, membrane firactions, antifoam agents, and/or scintillants. 
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Ao used herein, the term "replication pheiiotype" refers to one or more of 
the following phenotypic characteristics of cells infected with a virus: (l)substo^^^ 
expression of late geAe products, such as capsid proteins (e.g., adenoviral pentpn base 
polypeptide) or RNA transcripts initiated ftom viral late gene promoter(s), (2) replication 
of viral genomes or formation of replicative intermediates, (3) assembly of viral capsids 
or packaged virion particles, (4) appearance of cytopathic effect (CPE) in the infected 
cell, (5) completion of a viral lytic cyde, and (6) other phenotypic alterations which are 
typically contingent upon substantial replication of a yirus in cells infected with a wild- 
type repHcation competent vii^s encoding functional viral p^^ 

phenotype comprises at least one of the listed phenotypic characteristics, preferably more 

than one of the phenotypic characteristics. 

• As used herein, the term Replication deficient vinis" refers to a virus that 

supports expression of a virus replication phenotype, and which is substantially unable to 
independently complete the replication cycle to produce infectious virions in the absence 
of helper virus or helper functions acting in trans. T>pically, a replication deficient virus 
exhibits a substantial decrease in plaquing efficiency on cells conventional^ 

plaque titer aparent virus.'. : - - ; , . - ■ - - - - 

■ As used herein, the terms "label" or. "labeled" refer to incorporation of a 
detectable marker.' elg., a radiolabeled amino acid or a recoverable labeKe.g. biotinyl 
moieties that can be recovered by avidin or streptavidin). Recoverable labels can include 
covalently linked polynucleobase sequences th^ 

complementary sequence polynucleotide. Various methods of labeling polypeptides. , 
PNAs. and polynucleotides are known in the art and may be used. Examples of labels 
include, but are not limited to. the following: radioisotopes (e.g.. 3H, 14C. 3.5S. 1251. 
131I)Jfluorescent or phosphorescent.labels te.g., FITC. rhodami^ 
phosphors). Enzymatic labels (e.g:, horseradish peroxidase, P^galactosidase, luciferaie,^ 
alkaline phosphatase), biotinyl groups, predetermined polypeptide epitopes recognized by 
a secondary reporter (e.g.. leucine zipper pair sequences. binding sites for antibodies, 
transcriptional activator polypeptide, metal binding domains, epitope tags). In some 
embodiments, labels are attached by spacer arms of various lengths, e.g., to reduce 
potential steric hindrance. 
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As used herein, the term "statistically significant" means a result (i.e., an 
assay readout) that generally is at least two standard deviations above or below the mean 
of at least three separate determinations of a control assay readout and/or that is 
statistically significant as determined by Student's t-test or other mt-accepted measure of 
5 statistical significance. 

The term "transcriptional modulation" is used herein to refer to the 
capacity to either enhance transcription or inhibit transcription of a structural sequence 
linked in cis; such enhancement or inhibition may be contingent on the occurrence of a 
specific event, such as stimulation with an inducer and/or may only be manifest in certain 
10 cell types. 

The term "agent" is used herein to denote a chemical compound, a mixture 
of.chemical compounds, a biological, macromolecule, or an extract made fi-om biological 
materials such as bacteria, plants, fungi, or animal (j)anicularly mammalian) cells or 
tissues. Agents are evaluated for, potential activity as antiviral agents by inclusion in 
1 5 screening assays described herein below. 

The term "candidate agent" is used herein to refer to an agent which is 
identified by one or more screening niethod(s) of the invention as a putative antiviral 
agent. Spme candidate antiviral agents may have therapeutic potential as drugs for 
human use. 

20 . As used herein, "substantially pure" means an object species is the , 

predominant species present (i.e., on a molar basis it is more abundant than any other 
individual macromolecular species in the composition), and preferably a substantially 
purified fraction is a composition wherein the object species comprises at least about 50 
percent (on a molar, basis) of all macromolecular species present. Generally, a 
■ 25 substantially pure composition will comprise more than about 80 to 90 percent of all 

macromolecular species present in the composition. Most preferably, the object species is 
purified to essential homogeneity (contaminant species cannot be detected in the 
composition by conventional detection methods) wherein the composition consists 
essentially of a single macromolecular species. Solvent species, small molecules (<500 
30 Daltohs), and elemental ion species are not considered macromolecular species. 

As used herein, the term "optimized" is used to mean substantially 
- . improved-in a desired structure.oriimctionjalatiyejp^m^ 
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necessarily the optimal structure or function which could be obtained if all possible 
combinatorial variants could be made and evaluated, a condition which is typically ■ 
impractical due to the number of possible combinations and permutations in 
. polynucleotide sequences of significant length (e.g., a complete viral gene or genome). 

5 DETAILED DISCUSSION OF THE INVENTION 

The invention relates in part to a generally ap^^ 

generating hovel variant viruses having genotypes and phenotypiss which do not naturally 

. occur or would be anticipated to occur at a substantial frequency. A broad aspect of the 

method employs recursive nucleotide sequence recombiriatiGn, term ■ 

1 0 shuffling", which enables the rapid generation of a collection of broadly diverse viral 

phenotypes that can be selectively bred for a broader range of novel phenotypes or more 

extreme phenotypes than would otherwise occur by natural evolution in the same time 

period. A basic variation of the method is a recursive process comprising: (1) sequence 

shuffling of a plurality of viral genomes; (2) selection of the resultant shuffled viral 

1 5 genomes to isolate or enrich a plurality of shuffled viral genpmes having a desired 

phenotype(s), and (3) repeating steps (1 ) and (2) on the pluraUty of shuffled viral 

• genomes having the desired phenotype(s) until one or more variant viral genomes having 

: a sufficiently optimized desired phenotype is obtained. In this general maimer, the 

method facilitates the "forced evolution" of a viral genome to encode a desired phenotype 

20 which natural selection and evolution has heretofore not generated. Fig. I shows a block 

dia^am of a basic method for viral genome shuffling^ a^^^^ 

: phenotype; the recursion option is generally selected each cycle until one or more viral 

genonies having a satisfactory optimization for the desired phenotype(s) are obtained. 

. Typically ; a plurality of viral- genomes of ^ 

25 xlassification.are shuffled and selected by jhe preset 

common use of the method will be to shuffle mutant variants of a clinical isolate(s) or of 

a laboratory strain of a virus to obtain a variant of the clinical isolate or laboratory strain 

that possesses a novel desired phenotype. However, the method can be used with a 

plurality of strains (or clades) of a vims, or even with a plurality or related viruses (e.g., 

30 lentiviruses, herpesviruses, adenoviruses, etc.), and in some instances with unrelated 

. viruses or portions thereof which have recombinogenic portions (either naturally or 



wo 99/23107 PCT/US98/23107 

. ■ ' 23 

generated via genetic esngineering): As long as two sequences have a region of sequence ^ 
similarity, they can generally be combined. . ^ ' : 

The method can be used to shuffle xenogeneic viral sequences into a viral 
genome (e.g., incorporating and evolving a gene of a first virus in the genome of a second 
5 virus so as to confer a desired phenotype to the evolved genome of the second virus). . 
Furthermore, the method can be used to evolve a heterologous sequence (e.g., a non- ■ 
naturally occurring mutant viral gene) to optimize its phenotypic expression (e.g., r 
function) in a viral genonie, and/or in a particular host cell or expression systehi (e.g., an 
expression cassette or expression replicon). Fig. 2 shows axi example schematic 
10 representation of recombinatorial shuffling of a collection of viral genomes having a 
variety of mutations or distinct genome portions; positions of mutarions are indicated by 
an (X), and distinct genome segments (e.g., obtained from the genomes of different virus 
isolates) are indicated by a open box. ' 

/ .In an aspect of the invention, the phenotype(s) which are selected for are 

15 the tropismand/or host range of the virus. Tropism is often defined as the cell type ' 
which can be productively infected by a virus (e.g., CD4-f- T cells for HIV-1, 
nasopharyngeal epithelium for rhinovirus, etc;), and host raiige is conomonly defined as 
the species of organism in which the virus can replicate (e.g., humans, simians,, mice, rats, 
etc.). Both tropism and host range are believed to be restricted by the specific type(s) of 

20 proteins expressed by a cell; a cell lacking expression of a necessary protein that acts as a 
viral receptor may fail to'support infection by the virus, similarly a virus may have . , 
evolved to use a host cell protein (e.g., polymerase) in one species (e.g., human) but not 
in another species (e.g., mouse). The present method can be used to create variant 
viruses which exhibit altered tropism or host range by employing the rapid forced , 

25 evolution of shuffling to generate variant viruses that are adapted to the desired tropism or 
host range.' As an example of this, HIV-1, which does not naturally replicate.in mouse 
cells, can be evolved to do so by the present method. Similarly, it is believed that HIV- 1 , 
which normally does not infect human fibroblasts, can be evoly ed to; do so by the present 
method. The method is general and can be employed to modify tropism and/or host 

30 range of substantially any virus suitable for recursive sequence shuffling (e.g., viruses 
that can be rescued as infectious virions following sequence shuffling). Fig. 3. shows a 

schematic portrayal bf-vinis4ropisni/hostTange.evolution.byjvdral^ 
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produce recombinant variants capable of substantial replication in a non-human animal or 
cell line, such ais a mouse. 

' ■ ' The present method can also be employed to force evolution of host ceUs 
and transgenic animals tp support infection and/or replication of a virus which normaUy 
does not infect or replicate in said cells or transgenic animd. For example, a transgenic 
mouse (or mouse cell line) expressing a transgene encoding a human chemokine receptor 
protein (e.i, CCR5, CXCR4) and/or a human CP4 protein can be improve 
supporting infection by HIV-1 by selecting those transgenic mice (or transgenic mouse 
cells) which exhibit at least bne, preferably more, positive phenotypic characteristics of 
HIV-1 infection (e.g., attachment of HTV-1, yiral entry, replication, expression of a 
reporter gene encoded by HIV-l, production of infectious HIV-1 virions, etc.), and 

shuffling: the transgenes (or portions thereof)' froin' the selected mice (or cells) which 
encode the chemokine receptor proteih(s) and/or human CD4. By recursive shuffling 
and selection, it is believed possible to force the evolution of transgene-encoded proteins 
which permit infection by HIV-l. HIV-l variant viruses may be allowed to co-evolve 
with the transgenic mice (or transgenic mouse cells), or, the transgenic mice (or cells) can 
be optimized to support naturally-occurring HiV-i isolates. 

. : ' Although described with specificity with respect to HIV- 1 , the present 
;;method can be used with substantially any type of virus having a genome suitable for in 
' vitro or in vivo sequence shuffling, including but not limUed to hepatitis C virus (HCV), 
adenovirus, adeno-associated virus (AAV), lentiviius, hepatitis B virus, HIV-2, murine 
leukeiiiia virus (MLV) among others. Preferably, the viruses used in the method of the 
invention are suitable for cloning of an intact genome (or a substantial portion thereof), 
such that the genomic sequences can .be shufiled by a recombination method of the 
invention, and the recombined^iral genome^^^^^ 

through the use of a helper'eell line, helper virus, or in vitro fecbnstitution of replication 
intermediates (see, for example, U.S. Patent 5.166,057 and WO97/12032). Such 
genome recovery and rescue systems are known in the art for many types of viruses, and 
can be appUed by the practitioner to the virus type of interest. Thus, both viruses having 
either an RNA genome or DNA genome are suitable for use in the method. Intact 
genomes (or portions thereof) can be recovered fiom virions, as repUcative intermediates 
from host cells, as cDNA copies, or as integrated provirus from a host cell genome (e.g.. 
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as a cosmid clone or lambda clone). The recovered viral genome sequences can br 
shuffled with other viral genome sequences and/or with one or more spiked 
polynucleotide specie(s)'(e.g., mutation-bearing gene sequences or mutation-bearing 
intergenic viral genome sequences), which may include optimized components of a viral 
5 genome that have been separately optimized by shuffling (e.g., a Tat gene sequence or a 
tar sequence of HIV- 1 which has been optimized for function in mouse cells). Optimized 
components typically can include expression cassettes encoding viral genes, viral 
transcriptional regulatory sequences, origins or replication, non-coding sequences 
important for replication (e.g., panhandle sequences of influenza virus genome segments), 

10 LTRs, repeat sequences, and the like. For viruses with segmented genomes, individual 
segments may be optimized separately by recursive sequence shuffling and selection, or a 
combination or all of the segments may be optimized collectively for a desired 
pheriotype; it is also possible to combine one or more cycle(s) of individual 
component/segment evolution with one or more cycle(s) of collective 

15 component/segment evolution, in any order. 

In an aspect of the invention, a plurality of replication defective viral 
genomes are shuffled and the resultant shuffled genomes are selected for the capacity to 
replicate in a desired cell type or host organism. 

In an aspect of the invention, complementing genome portions of or 

20 complete genomes of two or more distinct virus types (e.g., HIV-1 and SFV) are shuffled 
and phenotype selected to generate and isolate one.or more shuffled variant virus 
genomes that have a desired phenotype (e.g., the capacity to repUcate in simian cells but 
' ' retain a substantial portion of the HIV- 1 genome). "The resultant shufflants comprising a 
portion of an HIV-1 (or HIV-2) genome and a portion of an SIV genome, and having 

25 functional sequences sufficient to support replication in a host cell are termed "SHTV 
recombinants". Kuwata etal. (1996) AIDS 10: 1331 report chimeric viruses between 
SrV and various HIV-l isolates that have biological properties similar to those of parental 
HIV-1 . Unlike the present invention, the chimeras made by Kuwata et al, are simple 
recombinants of discrete genome portions of SIV and HIV-1, and are not the product of 

30 recursive sequence shuffling and selection for a desired phenotype. 
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RECOMBINATION METHODS 

The present invention provides methods, reagents, geneti^^^ 

animals and cells, and pharmaceutical compositions relating to the forced evolution of 

viral genomes, or portions thereof, haying a desired phenotypic alteration generated by 

5 polynucleotide sequience shuffling of a plurality of viral genomes, typically of the same : . 

virus type (e.g., HIY4,.HCy,;adenovinis,.etcO.^ . / , , 

. Generally, the nomenclature used hereafter and the labbr^^ ' 

in cell culture, molecular genetics, virology, and nucleic acid chemistry and hybridization 

described below are those well known and commonly employed in^to^ 

10 ' techniques are used for recombinant nucleic acid methods, polynucleotide synthesis, and 
microbial culture and transformation (e,g., electropor alien, lipofection). Generally, 
enzymatic reactions and purification steps are performed according to the manufacturer's 
: specifications. The techniques and procedure's ai-e generally performed according to 
conventional methods in the art and various general references. 

15 Forexample, the practice of this invention involves the construction of ; 

recombinant nucleic acids, the expression of genes in transfected host cells "and molecular 
library construction. Molecular-cloning techniques to achieve these ends are 
the art. A wide variety of cloning and in vitro amplification methods suitable for the 
construction of recombinant nucleic acids such as expression vectors are well-known to 

20 persons of skill; General texts which describe mblecular biological techniques useful 

herein, including mutagenesis, include Berger and Kimmel; Guide to Molecular Cloning 
Techniq ues, Methods in Enzymolpgy volume 152. Academic Press, Inc.j San Diego, CA 
(Berger): Sambrook et al.; Molecular Cloning - A Laboratorv Manual (2nd Ed.\ Vol. 1- 
3, Cold Spring Harbor Laboratory, Cold Spring Harbor,. New York, 1989 ("Sambrook") 

25 and Cun-ent Protocols in Molecular Biology, F.M, Ausubel et al., eds., Cxirrent Protocols, 
a joint venture between' Greeiie Piibli^^ Inc.^md John Wiley &^^^ Inc., 

(supplemented through 1998) ("Ausubel")). 

Examples of techiuques . sufficient to direct persons of skill through in vitro 
amplification methods, including the polymerase chain reaction (PGR) the ligase chain 

30 reaction (LCR), Q -replicase amplification and other RNA polymerase mediated 

techniques (e.g., NASBA) are found in Berger, Sambrook, and Ausubel, as well as MuUis 
et al., (1987) U.S. Patent No. 4.683.202: Technology: Principles and Applications for 
DNA Amplification ed. HA Erlich, Freeman Press, New York, NY (1992); Mattila et al. 
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r 1 99 n Nucleic Acids Res . 19: 4967; Eckert, K.A. and Kunkel, T.A. (1991) PGR 
Methods and Applications 1:17; PGR, eds. McPherson, Quirkes, and Taylor, ERL Press, 
Oxford; and U.S. Patent 4,683;202, which are incorporated herein by reference). . PGR 
Protocols A Guide to Methods and Applications (Innis et al. eds) Academic Press Inc. 
5 San Diego, GA (1990) (Innis); Amheim&Levinson (October 1, 1990) G&EN 36-47; 
The Journal Of NIH Research (1991) 3, 81-94; (Kwoh et al. (1989) Proc. Natl Acad. Sci. 
USA 86, 11 73; Guatelli et al. (1990) Proc. Natl. Acad. Sci. USA 87, 1874; Lomell et al. 
(1989) J. Clin. Chem 35, 1826; Landegren et al., (1988) Science 241, 1077-1080; Van 
Brunt (1990) Biotechnology 8, 291-294; Wu and Wallace, (1989) Gene 4, 560; Barringer 

10 etal. (1990) Gene 89, 117. and Sooknanan and Maleka995) Biotechnology 13: 563-564. 
Improved methods of cloning in vitro amplified nucleic acids are described in Wallace et 
al., U.S. Pat. No. 5,426,039. Improved methods of amplifying large nucleic acids by PCR 
are summarized in Cheng et al. (1994) Nature 369: 684-685 and the references therein, in 
which PGR amplicons of up to 40kb are generated. One of skill will appreciate that 

15 essentially any RNA can be converted into a double stranded DNA suitable for restriction 
digestion, PGR expansion and sequencing using reverse transcriptase and a polymerase. 
• See, Ausubel, Sam.brook and Berger, all supra. 

OUgonucleotides for use as probes, e.g., in in vitro amplification methods, 
for use as gene probes, or as shuffling targets (e.g., synthetic genes or gene segments) are 

20 typically synthesized chemically according to the solid phase phosphoramidite triester 
method described by Beaucage and Garuthers (1981), Tetrahedron Letts .. 22(20):1859- 
1862, e.g., using an automated synthesizer, as described in Needham-VanDevanter et al. 
(1984) Nucleic Acids Res., 12:6159-6168: Oligonucleotides can also be custom made 
and ordered firom a variety of commercial sources known to persons of skill. 

25 Indeed, essentially any nucleic acid with a known sequence can be custom 

ordered firom any of a variety of conmiercial sources, such as The Midland Certified 
Reagent Gompany (mcrc@oligos.com), The Great American Gene Gompany 
(http://www.genco.com), ExpressGen Inc. (www.expressgen.com), Operon Technoloigies 
Inc. (Alameda, GA) and many others. Similarly, peptides and antibodies can be custom 

30 ordered fi-om any of a variety of sources, such as PeptidoGenic (pkim@ccnet.com), HTI 
Bio-products, inc. (http://www.htibio.com), BMA Biomedicals Ltd (U.K.), Bio.Synthesis, 

Inc., and many others. 
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Chimeric targeted mice are derived according to' Hpgan, et al.. 
Manipulating the Mouse Embryo: A Laboratory Manual Cold Spring Harbor Laboratory 
(1988) and Teratocafcinomas and Embryonic Stem Cells: ' A Practical Approach, E.J. 
Robertson, ed., IRL Press, Washingtph, D.C., (1987) .which are.incorporated herein by 

5 ■' reference.'. ■ ■■ • ' 

- . Embryonic stem cells are manipulated according to published procedures 

(Teratocarcinomas and Embryonic Stem Pells: A Practical Approach, E.J. Robertson, 
■ 1 ; ed:, IRL Press, Washington, D.C. (1987); Zjilstra et al:. Nature 342:435-438 (1989); and 

Schwartzberg et al.. Science 246:799-803 (1989), each of which is incorporated herein by 
.10 reference). . ' 

' ■ ' AH sequences' referred to herein by: GenBank database file designation 

, '(e g:, GenBank: Humatct4a) of a conimonly used reference name which is indexed in 
V. GenBank'are incorporated herein by reference and are publicly available. , 

• ■ ■ DNA. Shuffling " ' . 

' 15 ' '. " . Exemplary fonnats and examples for sequence recombination, referred to, 

e.g., as "DNA shuffling," "fast forced evolution," or "iholecular breeding," have been 
described by the present inventors and co-workers' in the following patents and patent 
applications: US Patent No. 5,605,793; PCT Application WO 95/22625 (Serial No. 
, 'PCT/US95/02126), filed February 17, 1995; US Serial No. 0 

20 1995; US Serial No. 08/621 ,430, filed March 25, i996; PCT Application WO 97/20078 . 
(Serial No. PCT/US96/05480), filed April 1 8, 1996; PCT Application WO 97/35966, 
filed Marcli 20, 1997; US' Serial No. 08/675,502, filed July 3; 1996; US Serial No. 
08/72li 824, filed September 27, 1996; PCT Application WO 98/13487, filed September 
26, 1997; "Evolution of Whole CeUs and Organisms by Recursive Sequence 

25 Recombination" Attorney Docket No. 01 8097-020720US filed July 1 5, 1998 by del 

• Cardayre et'al. (USSN____); Stemmer, Science 270:1510 (1995); Stemmer a/.^ Gene 
164:49-53 (1995); Stemmer, Bio/fechnology 13:549-553 (1995); Stemmer, Proc. Natl. 
^cad. 5ci. 91:10747-10751 (1994); Stemmer, iVarwre 370:389-391 (1994); 
Crameri et al., Nature Medicine 2(l):l-3 (1996); Stemmer, U.S. Pat. 5,811,238, and 

30 Crameri et al.. Nature Biotechnology 14:315-319 (1996), each of which is incorporated 
by reference in its entirety for all purposes. 
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'■ Nucleic acid sequence shuffling is a method for recursive in Vitro or in 
vivo homologous or nonhomologous recombination of pools of nucleic acid fragments or 
polynucleotides (e.g., viral genomes or portions thereof). Mixtures of related nucleic 
acid sequences or polynucleotides are randomly or pseudorandomjy fragmented, and 
5 , reassembled to yield a library or mixed population of recombinant nucleic acid mole'cules 
or polynucleotides. . 

^ The present invention-is directed to a method for generating a selected 
polynucleotide sequence (e.g., a viral genome or viral gene) or population of selected 
polynucleotide sequences, typically in the form of amplified and/or cloned 
1 0 polynucleotides, whereby. the selected polynucleotide sequence(s) possess a desired 
phenotypic characteristic (e.g., encode a polypeptide, promote transcription of linked 
polynucleotides, bind a protein, and the like) which can be selected for, and whereby the 
selected polynucleotide sequences are viral genomes or genes having a desired 
functionality and/or conferring a desired phenotypic property to a viral genome. ' 'One 
15 ' method of identifying novel viral genome sequences that possess a desired structure or 
■ fimctional property, such as having ar^.:^Jtered tropism or host range (e.g 

capable of substantia] infection and replication of a non-human host), involves the ^ 
' screening of a large library of recombinant viral sequences (which can be a component of 
a viral genome - e.g., part of a viral gene, non-coding transcriptional regulatory sequence, 
20 originofreplication,- or a complete viral genome) for individual library members which 
possess the desired structure or functional property conferred by the novel viral genome 
sequence. ; 

— - ; In a general aspect; the invention provides~a methodrtermed *'sequm^ 

shuffling," for generating libraries of recombinant poljoiucleotides 
25 characteristic which can be selected or screened for. Libraries of recombinant 

polynucleotides are generated from a population of related-sequence polynucleotides 
which comprise sequence regions which have substantial sequence identity and can be 
homologdusly recbmbined in vitro or in vivo. In the method, at least two species of the 
related-sequence polynucleotides are combined in a recombination system suitable for 
30 generating sequence-recombined polynucleotides/ wherein said sequence-r|M^ 

polynucleotides comprise a portion of at least one first sjpecies of a related-sequence 
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polynucleotide with at least one adjacent portion of at least one second opecies of. a 
related-sequence polynucleotide. 

Recombination systems suitable for generating sequence-recombined 
' polynucleotides can be dther:(l) in yitro systems for honioto^ 
5 sequence shuffling via amplification or other formats described herein, (^^ 

systems for homologous recombination or site-specific recombination as described 
herein, or template-switching of a retroviral genome repUcation event or (3) whole 
genome approaches in which entire genomes are shuffled. The population of sequence- 
recombined polynucleotides compriseis a subpopulation of pplynucleotidss; which possess 
10 desired or advantageous characteristics and which can be selected by a suitable selection 
or screening method. The selected sequence-recombined polynucleotides, which are 
typically related-sequence polynucleotides, can then be subjected to at least one recursive 
cycle wherein at least one selected sequence-recombined polynucleotide is combined with 
at least one distinct species of related-sequence polvTiucleotide (which may itself be a 
15 selected sequence-recombined polynucleotide) in a recombination' system suitable for 
generating sequence-recombined polynucleotides, such that additional generations of . 
sequence-recombined polynucleotide sequences are generated from the selected . , 
sequence-recombined polynucleotides obtained by the selection or screening m^ 
employed. In this manner,- recursive sequence recombination generates library members 
20 which are sequence-recombined polynucleotides possessing desired characteristics. Such 
characteristics can be any property or attribute.capable of being selected for or detected in 
a screening systetn, and may include properties of: an encoded protein, a transcriptional 
element, a sequence controlling transcription, RNA processing, RNA stability, chromatin 
conformation, translation, or other expression property of a gene or. ti^^ v 
25 replicative element, a protein-binding element, or the like, such as any feature which 
confers a selectable or detectable property. r ;v : 

Screening/selection produces a subpopulation of viruses (or cells) 
expressing recombinant forms of gene(s) or virus genomes that have evolved toward 
acquisition of a desired property. These reconibinant forms can then be subjected to 
30 further rounds of recombination and screening/selection in any order. For example, a 
second round of screening/selection can be performed analogous to the first resulting in 
greater enrichment for genes having evolved toward acquisition of the desired property 
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(e.g., tiopism of a virus in a selected cell type). Optionally, the stringency of selection 
can be increased between rounds (e.g., if selecting for drug resistance, the concentration 
of drug in the media can be increased). Further rounds of recombination can also be 
perforrned by an analogous strategy to the first round, generating further recombinant 
5 forms of the gene(s) or genome(s). Alternatively, further rounds of recombination can be 
performed by any of the other molecular breeding formats discussed. Eventually, a 
recombinant form of the gene(s) or genome(s)is generated that has fully acquired the 
desired property. 

The method of shuffling can generate libraries of polynucleotides (viral 
1 0 genomes, transgene polynucleotides) encoding selectable properties, including altered 
tropism and/or host range, which can compose all or a part of a viral genome or host cell 
transgene, wherein the library is suitable for function optimization of a gene or regulatory 
sequence or phenotypic screening. The method comprises, e.g., (1) obtaining a first 
plurality of library members comprising a viral genome, viral gene, viral regulatory or 't- 
1 5 replication sequence, or host cell transgene (or encoding sequence or expression cassette '-^ 
thereof), and obtaining from said library a polynucleotide, or copy thereof, complete or 
partial, of at least one selected library member having a detectable desired phenotype, 
optionally introducing mutations into said polynucleotide or copy(ies), and (2) pooling 
and fragmenting, by nuclease digestion, partial extension PGR amplification* PGR 
20 • stuttering, or other suitable fragmenting means, typically producing random fragments or ^ 
fragment equivalents, said selected polynucleotide(s) or copies to form fragments thereof 
under conditions suitable for PGR amplification, perfonning PGR amplification and 
optionally mutagenesis, and thereby homologously recombining said fragments to form a 
shuffled pool of recombined polynucleotides, whereby a substantial fraction (e.g., greater 
25 than 10 percent) of the recombined polynucleotides of said shuffled pool are not present 
in the first plurality of selected library members, said shuffled pool composmg a library 
of shuffled selected variant viral genome sequences or transgene sequences suitable for 
functional screening or phenotype screening. Optionally, the method comprises the 
additional step of screening the library members of the shuffled pool to identify 
30 individual shuffled library members having the desired functional ability or phenotype. 
The novel shufiled viral genomes, viral genome sequences, and transgene sequences that 
are identified from such libraries can be used for model non-human systems of viral 
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replication, infection, antivira! agent screening, candidate attenuated live virus vaccines 
and therapeutics, and the like; and/of can be subjected to one or more additibnal cycles of 
shuffling and/or functional optimization or phenotype selection for further optimization. 
The method can be modified such that the step of selecting is for a phenotypic 
5: characteristic other than viral infectivity, replication, host range, tropism", gene function, 
transcriptional regulatory sequence function, or the like. 

In an embodiment, the first plurality of selected library memhiisrs is 
fragmented and homologously recombined by PCR in vitro. Fragment generation is by 
nuclease digestion, partial extension PCR amplification, PCR stuttering, or other suitable 

1 0 fragmenting means, such as described herein arid in W095/22625 published 24 August 
1995, and in commonly owned U.S'.S.R U.S.S.N. 08/621,859 filed 25 March 1996, 
PCT/US9'6/p5480 filed 1 8 April 1 996, which are incorporated herein by reference). 
Stuttering is fragrhentation by incomplete polymerase extension of templates. A 
recombination format based on very short PCR extension times can be employed to create 

15 partial PCR products, which continue to extend off a different template in the next (and 
' subsequent) cycle(s), and effect de facto fragmentation. Template-switching and other 
formats which accomplish sequence shuffling between a plurality of sequence-related 
polynucleotides can be used. Such alternative formats will be apparent to those skilled in 
^ the art. : ■ ' • ■ \ ■/ - ' ■ 

20 ' In an embodiment, the first plurality of selected library members is 

fragmented in vitro, the resultant fragments transferred into a host cell or organism and 
homologously recombined to form shuffled library members in vivo. 

In an embodiment, the first plurahty. of selected library members is cloned 
or amplified oh episomally replicable vectors, a multiplicity of said vectors is transferred 

25 into a cell and homologously recombined to form shuffled library members in vivo. 

In an embodiment, the first plurality of selected library members is not 
fragmented, but is cloned or amplified on an episomally replicable vector as a direct 
repeat or indirect (or inverted) repeat, which each repeat comprising a distinct species of 
selected library member sequence, said vector is transferred into a cell and homologously 

30 recombined by intra-vector or inter-vector recombination to form shuffled library 
members in vivo. 
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' In an embodiment, first plurality of selected library members is replicated 

under conditions wherein retroviral template switching between at least two xenogeneic 
viral genomes occurs, typically involving retroviral genomes or non-retroviral genes , 
cloned into a retroviral replication system.. , - . 

5 In an embodiment, combinations of in vitro and in vivo shuffling are 

provided to enhance combinatorial diversity. The recombination cycles (in vitro or in 
vivo) can be performed in any order desired by the practitioner. 

The present invention provides a method for generating libraries of viral 
genomes or viral genetic sequences suitable for phenotype screening, such as to generate 

10 enhanced fimction in a cell type and/or animal species, modify viral tropism or host 

range, or other desired property. The method comprises ( 1 ) obtaining a first plurality of 
library members comprising a viral genome polynucleotide or portion thereof, (2) pooling 
and fragmenting said polynucleotides or copies to form fragments, thereof under 
conditions suitable for PGR amplification and thereby homologously recombining said 

1 5 fragments to form a shuffled pool of recombined polynucleotides comprising novel 
combinations of viral sequences, whereby a substantial fraction (e.g., greater than 10 
percent) of the recombined polynucleotides of said shuffled pool comprise viral genome 
sequence combinations which are not present in the first plurality of library members, 
said shuffled pool composing a library of viral genome sequences comprising sequence . 

20 combinations suitable for phenotype screening. Optionally, the plurality of selected , 
shuffled library members can be shuffled and screened iteratively, from 1 to about 1 000 
cycles or as desired until library members having a desired binding affinity are obtained. 

Often, from"2 to 25 cycles of recursion are performed before a sufficiently optimized 

shufflant (i.e., selected shuffled library member) is obtained. The degree of optimization 

25 for any particular application will vary based on the specific intended use and other . 
considerations (e.g., time, minimization of mutational drift, etc.) that are selected.by the 
practitioner. - 

The invention also provides the use of polynucleotide shuffling to shuffle a 
population of viral genes (e.g., capsid proteins, spike glycoproteins, polymerases, 

30 proteases, etc.) or viral genomes (e.g., adenoviruses, AAV, MoMuLV, HCV, lentiviruses, 
retroviruses or any other known classification) to develop enhanced viral genomes having 
a desired phenotypic property. In an embodiment, the invention provides a method for 
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shuffling sequences encoding all or portions of immunogenic viral proteins to generate 
novel; combinations of epitopes as well as novel epitopes created by recombination to > . 
provide gene thefkpy vectors which elicit (or avoid) strong immune responses in a host. 
In an embodiment, the invention provides viral vectors which have enhanced capacity to 
> infect a desired target celi population and/or an enhanced.cell type specificity to reduce 
infection of non-target cells. In one aspect, such improved yiral vectors can.serve as. 
platforms for delivery of gene therapy sequences orvectprrencoded immunogenic vaccine 

epitopes to a host. : • . 

For viral genome sequence shufifling,(ortransgehe shuffling) by , 

0 homologous recombination (including by PCR-based homology overlaps) at lea^ , 

species of the related-sequence polynucleotides are combined in a recombination system 

: •'• ; suitable for generating sequence-recombined polynucleotides,' wherein said sequence- 

recombined polynucleotides comprise, a portion of at least one first species of a related- 
• . sequence polynucleotide with at least one adjacent portion of at least one second species 
5 of a related-sequence polynucleotide: Recombination systems suitable for generating. 
' ' ■ sequence-recombined polynucleotides can be either: (1 ) in'vitro 'systems for homologous 
recombination or sequence shuffling via amplification .or other formats described herein, 
or (2) in vivo systems for homologous recombination or site-specific recombination as 
. described herein. ,fhe population ofsequence-recombined polynucleotides comprises a 
10 . subpopulation of polynucleotides which possess desired or advantageous characteristics 
and which can be, selected by a suitable selection or screening method. The selected 
. sequence-recombined polynucleotides, which are typically related-sequence . 
■ :. polynucleotides, can then be subjected to at least one recursive cycle wherein at least one 
selected sequence-recombined polynucleotide is corabi^ 
25 species of related-sequence polynucleotide, (which may itself be a selected sequence- 
. . reeombinea polynucleotidfe) in a recombination system suitable for generating sequence- 
recombined polynucleotides, such that additional generations of sequence-recombined 
polynucleotide sequences are genefated.from tiie selected sequence-recombined 
polynucleotides obtained by tiie selection or screening method employed. In this 

30 manner, recursive sequence recombination generates library members .which are 
sequence-recombined polynucleotides possessing desired characteristics. Such 
characteristics can be any property or attribute capable of being selected for or detected 
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a screening system, and may include properties of: an encoded protein, a transcriptional 
element, a sequence controlling transcription, RNA processing, RNA stability, chromatin 
conformation, translation, or other expression property of a gene or transgene, a 
replicative element, a protein-binding element, or the like, such as any feature which 
5 confers a selectable or detectable property; A particular advantageous property is an : 
altered tropism or host range which allows a human-tropic virus to infect and replicate in 
a non-human host animal or non-human cell type, or an altered tropism which allows a 
virus to repHcate in a cell line which has desirable features (e.g., a cell line that has been 
approved by regulatory authorities, or is conveniently cultured, or the like) or altered cell 
10 tropism in a host (e.g., adenovirus that selectively infects melanoma cells and speciaUzed 
Ag-presenting cells, and Ae like). 

Forced Evolution of Models of Viral Disease 

The invention provides a means to evolve virus variants and/or host cells 
(or organisms) that are convenient non-human model systems for studying virus-induced 

15 pathology, virulence factors, attenuated live-viral vaccine candidates, and other aspects of 
viral infections, as well as providing a model system for evaluating a library of agents to 
identify candidate antiviral agents, that could find use as prophylactic and/or therapeutic 
drugs for human and veterinary applications. 

The methods of the invention can be used to force the evolution of a virus 

20 which has a host range or tropism that limits its infectivity and/or replication to hosts 
which are inconvenient to use as a model system (e.g., humans or other primates, large 
mammals, etc.). For example, a virus which has a host range restricted to humans can be 

modified by recursive sequence shuffling and selection for growth in-a non-himian host - 

(organism or cell culture) to produce shuffled variants that have significantly improved* 

25 capacity to infect and/or replicate and produce infectious virions in the non-hmnan host. 
In instances where there is no detectable infection or repHcation in a non-human host, 
shuffling of the virus of interest with a virus of a similar taxohomic type which is known 
to infect and/or replicate in the non-human host may generate a population of shuffled 
viral genomes which population contains one or more shuffled virus genomes that can 

30 replicate, at least weakly, in the non-human host. By obtaining at least one variant 
shuffled genome having some level of infection and replication in the non-human host 
(termed a "sparkplug variant")* the population of replicated virions can be collected from 
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the non^human host system and subjected to subsequent rounds of genome shuffling with: 
(1) each other, (2) one or more parental viral genomes. (3) mutated genomes (or portions 
thereof) of the collected, replicated virion genomes (i.e;, intentionally mutated genome 
sequences of the sparkplug variants;- such as by spiking v^th mutagenic oligonucleotides, 
error-prone PCR, or other suitable mutational methodology), (4) optimized viral genome 
; components (e.g., viral genes or non-coding regions which have been separately 
optimized for fimction in the nomhuman host), or (5) combinations of the above, in any 

order, among others. - ' 

. ■ In some instances, it may be difficult or impractical to generate the initial 
sparkplug variants by shuffling entire viral genomes; in such cases, it is generally . 
preferred that one or more individual components of the viral genome (e.g., genes, non- 
coding regulatory sequences, replication origins, essential structural sequences) from a 
parental virus are optimized for functionality in the non-human host.; A parental virus (or 
collection thereof) can then be shuffled with one or, more, optimized individual 
i component(s),includmg,ifdesired, multiple spedes of an optimizedcomponent, to 

generate a population of shuffled variant viral genomes which incorporate viral genome 
portions that have been optimized for fimction in the non-human host, 
approach to the shuffling method can be referred to as a "bottom-up" approach, i^^ 
contradistinction to the variation where genomes. of existing viral isolates are shuffled to ' 
0. createatleast^ne-sparicplugvariant-thatre^^^^^ 

down" approach). . ' 

■ In some instances, it may be desirable or necessary^ t^ 

evolve with the virus), by recursive sequence shuffling and selection, a non-human host 

or genetic componeiits thereof.. 
25 . . . . . ! , For exairiple and not to limit the inveiition,HIVr l has b«^ 
^ require certain human pfote^^ CXCM, and the, like) for infectivity of ■ 

cells (see, W097/28258; Moore and Trkola (1997) AIDS Research and Hunian 
Retroviruses 13: 733; and Bour et al: (1995) Mirmhiological.Reviews 59: 63), and 
possibly certain human genes for efficient replication in non-human cells (Hart et al. 
30 (1989) Science 246: 488). For example, a mouse (or mouse cell) harboring a transgene 
which expresses one or more species of human protein involved in fflV-1 infection 
and/or replication (e.g.. CD4, CCR5, CXCR4. or a tat transgene that has been evolved to 
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efficiently promoter transcription of HIV- 1 in mouse cells) benefits firom optimization for 
fimction in the non-human host; recursive sequence shuffling and selection can be used to 
generate optimized variants of such transgene(s). Host organisms or host cells harboring 
transgenes which exhibit some level of functionality (e.g. , ability to be infected with 
5 and/or replicate virus) can be selected for, the transgene sequence (or portion(s) thereof) 
recovered, and the recovered transgene sequence then shuffled with other such recovered 
transgene sequences and/or intentionally mutated transgene sequences to generate a 
population of shuffled transgene sequences that can be used to reconstitute transgenes that 
can be transferred into a subsequent generation of non-human host organisms or cells for 
10 one or more further rounds of selection for virus rephcation and shuffling, and so on. In 
certain enibodiments, the directed evolution of the viral variants and the directed 
evolution of the transgene sequences of the non-human host can be done in parallel, if 
desired, so as to co-evolve a virus variant/host variant combination with optimized 
function to support virus infectivity and/or replication (or other desired feature). 

15 Granularity of Shuffling 

The "granularity" of a shuffling event refers to the relative average density 

of recombination joints per unit length (e.g., per kilobase) or per recombined 

polynucleotide molecule (e.g., per functional viral genome). For illustration, a coarse • ^ 

granularity could be an average of one or less recombination joint per polynucleotide 

20 resulting fi-om a shuffling (i.e., sequence recombination event); a coarse granularity' of 
shuffling generates, a 'low crossover library" (as shown diagrammatically in Fig. 5). It 
is oftesn desirable to alter the granularity of shuffling in different recursion cycles, 
' although this is not necessary in many cases. The granularity desired can frequently be 
selected by the practitioner and is typically accomplished by controlling the degree of 

25 recombination in the recombination format selected (e.g., for a firagmentation/reassembly 
format, a high degree of fragmentation will generate a small average firagment size and 
hence a finer granularity; increasing the number of polynucleotide species shuffled can 
also be used to obtain finer granularity, among other ways apparent to those skilled in the 
art). The average size of segment firom the parental sequence(s) represented in the 

30 library of sequence-recombined polynucleotides is denoted as the "average segment 
length", and may be expressed by unit length (e.g., per kilobase) or as a firaction of the 
parental seque nce (e.g. , one-quarter genome of HIV- 1). . . . . 
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If a mutational Strategy is employed, it is frequently d^^^^ 
gmularitywMch results in an avW segment length wherein, on ave^^^^^^ • 

. (or slightly less) per segment is present.' 

The present method perinits the constructionof a library of shuffled viral 
genomes (or genome portions) wherein the library contains.a population of shuffled 
genomes ofany granularity desired by the practitioner. Libraries prepared from a 
plurality of parental viral genomes can be made to have substantially any granularity ; for 
example a viral genome library having, on average, at least two recombination joints 
■ (e:g., three distinct segments) per sequence^recombined genome can be gaierated, as can 
3 -yiral genomes having three, four, five, six, seven 

: joints (e.g., a viral gfehomic polynucleotide composed of 4,5,6,7,8,9,10, or 1 1 or more 
distinct-sequence segments).. Fig. 5 shows a schematic representation of high crossover ' 
(fine grariularity) and low crossover (coarse granularity) libraries. . 

■ Spiking . ■ ■ 

J.': : The basic sequence shuffling methodology can be used to shuttle a .. . 

, <^Uection\of related-sfiqupnces; wherein mostot ali of the related sequences, substantially 
• spahacertainphysicalpdrtionof ageneorgenome(e.g:,.astmctur4^^^^^ 
: transcHptional regulatory sequence, a replication origin, or For ; 

exainple, the coUectidn of related polynucleotides could represent, e.g. , alleles of a gene 
50 locus, variant viral genes, or genomes of distinct virus isolates^ 

;, embodiments it isdesirable to focus evolutionary ^^^^^^^ 
discrete-segments of a genomic polynucleotide (e.g. a specific viral gene) 0^ 
' particular gene (e.g., on a specific fiinctibnal domain or conserved sequence of a gene). . 
One methodblogical.mbdification to foetid 
25 genome is to "spike" a recombination reaction with additional pol^^^^ 

: represent ohly a subset ofthe locus being shuffied., these,"spiking polynucleotides!' can ^ 
enhance the potential sequence diversity at the locus subset (e.g., randomly or 
pseudorandomly increase mutation density at the locus subset), or can over-represent (or 
. under-represent) certain predetermined sequences in order to steer the sequence diversity 

30 in a predetermined direction (e.g.. to over-represent mutations which tend to produce a 
beneficial result based on prior results). Fig. 6 shows a schematic portrayal of a spiking 
embodiment. A spiking variation of the basic method of shufflmg can be exemplified by 
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a recombination reaction in which several HTV-l clinical isolate, genomes are shuffled, 
and a spiking mixture composed of subgenomic sequences (e.g., mutated Tat .gene 
sequences) are included to produce a resultant shuffled hbrary ' of HIV- 1 genomes having 
enhanced sequence diversity at the Tat locus. In some embodiments, the spiking 
5 polynucleotides are viral genome components which have been optimized separately for a 
desired phenotyjpe (e.g., functionaUty in'mouse cells) and are being shuffled into a 
collection of viral genomes to introduce said desired phehotype into the viral genomes. 

Backcrossing 

... After a desired phenotype is acquired to a satisfactory extent by a selected 

10 shuffled viral genome or portion thereof, it is often desirable to remove mutations which 
are* not essential or substantially important to retention of the desired phenotype 
("superfluous mutations"). Superfluous mutations can be removed by backcrossing, 
which is shuffling the selected shuffled viral genome(s) with one or more parental viral 
genome and/or naturally-occurring viral genome(s) (or portions thereof) and selecting the 

15 resultant collection of shufflants for those species that retain the desired phenotype. By 
employing this'method, typically in two or more recursive cycles of shuffling against 
parental or naturally-occurring viral genome(s) (or portions thereof) and selection for 
retention of the desired phenotype, it is possible to generate and isolate selected shufflants 
which incorporate substantially only those mutations necessary to confer the desired 

20 phenotype, whilst having the remainder of the genome (or portion thereof) consist of 
sequence which is substantially identical to the psurental (or wild-ty^ As 
oile example of backcrossing, an HIV-l genome can be shuffled and selected for the 

: * ^ -^c^acity to substantiallyinfect^m^ ; 

shufflants can be backcrossed with one or more genomes of cliiiical isolates of HTV-l. and 

25 selected for the capacity to retain the capacity to infect and replicate in mouse cells. After 
several cycles of such backcrossing, the backcrossing will yield HIV-l genome(s) which 
contain the xnutations necessary for replication and infection of mouse cells, arid will 
otherwise have a genoinic sequence substantially identical to the genome(s) of the clinical 
isolate(s) of HIV-l. 

30 Isolated components(e.g., genes, regulatory sequences, packaging 

sequences, replication origins, and the like) can be optimized and then backcrossed with 
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parental sequences so as to obtain optimized components which are substantially free of 
superfluous mutations. - . . 

Transgenic Hosts • ' 

For some embodiments of viral genome shuffling wherein a desired 
5 product is a non-htlman model system for a viral disease. it is sometimes necessary to 
introduce one or inore transgenes (or replacement genes - if by homologous 
recombination to replace an endogenous non-human gene with a human gene) that encode 
a human protein important for substantial infection, replication, and/or production of 
infectious virions, of the subject virus. For example, mice or mouse cells transgenic for 
1 0 expression of human CD4 and human CCR5 (optionally also human CXCR4) can be used 
as hosts for shuffled HIV- 1 genomes.. Other primate (including human) Cytpkihe 
receptor proteins which facilitate HIV infection (e.g., attachment and/or entry into a host 
cell) can be expressed from a transgene or expression vector in a host: cell or .organism, , 
. typically in combination with human CD4 and/or human GCR5 and/or human CXCR4. 
15 . inoneaspectofthe invention, transgenic nonhuman animals. Such as 

mice, bearing a transgene encoding a human cytokine receptor polypeptide and/or a 
human CD4 polypeptide are provided. Such transgenes may be homologously 
recombined into the host chromosome or may be non-homolpgously integrated. , 
Additionally, a human cytokiiie receptor (and/or human CP4);cDNA or genomic gene 
20 . copy may be used to construct transgenes for expressing cytokine repeptor (or CD4) 

polypeptides at hi^h levels and/or under , the transcriptional control of transcription control 
sequences which do not naturally occur adjacent to the cytokine receptor (or CD4) gene. 
For example but not Umitation, a constitutive promoter (e.g., a HSV-tk or pgk promoter) 
or a cell-iineage specific transcriptional regulatory sequence (e.g. , a GD4 or CD8 gene 
25. promoter/enhancer) may be operably linked to a encoding polynucleotide sequence to 
form a transgene (typically in combination with a selectable marker such as a neo geiie 
' expression cassette). Such transgenes,,which can include minigenes, can be introduced, 
into cells (e.g., ES cells, hematopoietic stem cells) and transgenic cells and transgenic 
nonhuman animals may be obtained according to conventional methods. Transgenic 
30 cells and/or transgenic nonhuman animals may be used: (1) to.screen for desired HIV-1 
variants capable of infection and/or replication in non-human (e.g., mouse) cells, (2) in 
conjunctipn with a mouse-adapted HIV-1 variant, to identify antiviral compounds in an 
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agent library, wherein said antiviral compound inhibits replication of a nonhuman- 
adapted HIV-1 in said transgenic non-human animal (e.g., mouse) or transgenic non- 
human cells, and (3) other uses apparent to those in the art in view of this disclosure. In 
some embodiments, a nonhuman-adapted HTV-l can be used in a non- transgenic, non- 
5 hxunan host, especially when the HIV-1 genome is introduced into the host by a non- 
infective mechanism (e.g., electroporation, Upofection, co-transfection, etc.) and the 
endpoint being studied is a replication phenotype., 

Transgenes and expression vectors can be constructed by any suitable 
method known in the art. It is often desirable to generate coding sequences for CD4, 

10 CCR5, CXCR4, and other, human accessory proteins that aid viral infectivity by either 
PGR or RT-PCR amplification fi"om a suitable human cell type (e.g., a T lymphocyte 
population) or by ligating or amplifying a set of overlapping synthetic oligonucleotides; 
. publicly available sequence databases and the literature can be used to select the 
polynucleotide sequence(s) to encode the specific protein desired, including any 

15^ mutations, consensus sequence, or mutation kernel desired by the practitioner. The 
coding sequence(s) are operably linked to a transcriptional regulatory sequence (e.g., T 
cell lineage-specific promoter/enhancer) and, if desired, an origin of replication (e.g., 
EBV on) for episomal replication, or one or more flanking sequences having substantial 
sequence identity to a host chromosomal sequence to provide for homologous 

20 recombination and targeted integration of the transgene. In an embodiment, a.transgene 
comprises a human CD4 minigene or a substantially complete human CD4 gene. Similar 
transgenes comprise a CCR5 and/or CXCR4 minigene or substantially complete gene. 
The transgenes can use the native gene transciptional regulatory sequences, or can - 
employ an operably linked heterologous, transcriptional regulatory sequence (e.g., a 

25 mouse CD4 promoter/enhancer, a CMV promoter/enhancer, a human.T cell receptor-gene 
promoter/enhancer, and the like). Often the transgene(s) and expression vector(s) will 
fiirther comprise a reporter gene or a selectable marker gene (e.g., tk, neo) in a selection 
cassette to facilitate identification and enrichment of cells having the construct in 
fimctional form. 

30 A wide variety of altemative transgene constructs suitable for expressing a 

human CD4, CCR5, and/or CXCR4 protein in non-human cells or animals vnll be 
apparent to the skilled artisan. 
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The transgene(s) and/or expression vectors are transferred into host cells, 
pluripotent embryonic stem cells, or embryonic pronuclei by a suitable method, such as 
for example hpofection, electroporation, microinjection, biolistics, viral transduction, 
calcium phosphate precipitation, or other method. Stable trarisfectant host cells can be 
5 prepared by art-known methods, as can transgenic nonrhuman animals. Typically, 

germline transmission of the transgene(s) in the transgenic non-human animal is obtained 
and the resultant "founder mice" are bred by conventional breeding to generate the 
desired progeny (e.g., a huCD4 transgenic male can be bred with a huCCRS transgenic 
female an offspring which are doubly transgenic: -CD4/CCR5 - are identified by Southern 

10 blot analysis of tail DNA). By obtaining the necessary collection of founder mice and 
interbreeding with screening for desired multiple transgenic genotypes, it is possible to 
generate a huCD4/huCCR5/huCXCR4 triple transgenic animal or any other desired 
combination of viral coreceptor proteins. Alternatively, a multiple transgenic can be 

* obtained by a single transfection with multiple transgene species (linked or unlinked), or 

15 by sequential rounds of transfection and selection in„ES,cells to producie a pluripotent ES 
cell havingthe desired collection of transgene species. 

Transgenic mice harboring germline copies of transgenes , that encode and 
express human CD4, human CCR5, and human CXCR4 on.lymphocytes or other cells ^ 
(e.g., macrophages) are produced by these methods. . These human viral coreceptor-" 

20 bearing mice caii be used to evolve an HIV^ 1 or HIV -2 viral genome to replicate ^ 

efficiently in mouse cells and to produce infectious virions. Similar transgenic mice can 
be generated for other virus types; the mice bear a transgene that expresses one or more . 
cell surface protein(s) that serve as a receptor for the virus of interest in a suitable cell 
' . tyipe (e.g' , lymphocyte,' lung epithelial cells, nasopharyngeal cells, squamous epithelium, 

25 etc.). These transgemcniice have at least one ceU type which is competent for in 
by the virus of interest. 

' In a variation, a transgene can be used to alter the cell type(s) competent 
for infection by a virus. For example, a virus that is tropic for lymphocytes bearing a 
particular receptor protein (e.g., CD4) may infect non-lymphocytic cells (e.g., 

30 hepatocytes) that express CD4 under transcriptional modulation of a cell type-specific 
transcriptional regulatory sequence (e.g., an albumin promoter/enhancer region). In this 
manner, a transgenic non-human animal can be generated which has a "chimeric cell- 
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type" having differentiated characteristics of multiple cell types (e.g., a CD4+ cell that is 
predominantly a hepatocyte). A wide variety of chimeric cell-types can be generated by 
the skilled artisan, both as transgenic non-human animals and as cultured cell strains or 
cell lines. 

5 Transgenic host cells and/or transgenic non-human animals can support 

infectivity by and/or replication of a virus which does not naturally infect or replicate in 
the non-human animal and/or cell-type. In a broad aspect, a transgenic non-human host 
cell or organism is generated so as to express a xenogeneic protein, or plurality of protein 
species, that function(s) as a receptor for attachment or entry of a virus which has a 

10 natural host range that does not include the non-human host animal. Similarly, the 
transgenic non-human host cell or animal can comprise a transgene which directs 
expression of a receptor protein to cell types or developmental stages which do not. 
normally express said receptor protein, thereby permitting the virus of interest to infect 
cell types outside the natural cell tropism of the virus. 

15 Transgenic mice, rabbits, rats, and hamster cells with sequences from 

human chromosome 11 and/or 12 are especially preferred for propagation of HIV-l and 
SHIV chimeric virus variants. Often, such transgenic non-human animals harbor a 
transgene, or multiple transgehes, encoding the expression of human CD4, human CCR5, 
and/or human CXCR4 in T lymphocytes or other cells. Transgenes encoding the 

20 expression of other human proteins can be similarly constructed and transgenic animals 
produced therefrom. 

Bridge Cells and Bridge Organisms 
_ . . -In some cases,- the desired non-human host cell or organism may be 

incapable of supporting replication of the virus in part because the desired host (e.g., 

25 mouse) is too distant phylogenetically from the natural host (e.g., human). The desired 

host may lack certain proteins necessary for replication of the virus, or may have 

equivalent host cell proteins which are too divergent from the natural host protein(s) in 

order to function effectively with the virus that has naturally evolved to function in its 

natural host. In such instances, it may be impossible to generate sparkplug variants by 

30 , directly transferring shufflants of primary viral isolates into the divergent non-human host 

cells or organisms, and alternative strategies to adapt the virus to grow in the desired non- 

himian host will need to be \ised. 
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One such strategy is to employ bridging host cells. Bridging host cells, 
and organisms are sufficiently close phylogenetically to the natural host so as to support 
some detectable levdofreplication and/or infeptiyity ofthe virus or mutated.,^^^^ ' 
shuffled variants thereof. The bridging host often is interaiediate in terms of 
phylogenetic distance between the natural host and the desired non-human host cell or 
organism. The capability of a bridging host to support vims replication can afforf 
practitioner with a suitable; non-human. host to support one ormore rounds of shuffling 
and selection that 'can be used to evolve the pool of variant vimses away fi-om rigid 
requirements for certain .unique stmctural features of imponant host factors^m 
host. For example, an Hiy.4 vims which does not replicate substantially in mouse cells . 
even when the mouse cell is modified so as to suppon infection by !he vims can be. 
adapted by passaging in a suitable bridge host (such 3s non-huma.ri primate 

competent to support HTV-1 replication), .typicklly in cGrnbmauon with recursive 
■sequence shuffling and selectionfor thephenotype of enhanced replication in the bridgiiig. 
host, or in a plurality of bridging hosts (to select for broadened host range, rather than jyst 
' chaiiged host range), In manyxases. the bridging host cell or organism is preferab^ 
; transgenic for a transgene encoding a protein ofthe natural host which is import^^^ 

substantial infectivity and/or replication of the vims (e.g., with respect to HIV-1 this can 
be a transgene expressing human GD4,CGR5, anchor GXCR4).particul^^^^^ , 
bridging host ceir naturally lacks a cognate (or functional equiv^^^^^^ 
After one or more cycles of passaging and/or sequence shuffling and phenotype selection 
forvimsvariantsthathaveevolvedtogrowihthebridginghost,anon-^^^^ 
... bridging host cell or organism can often be used since the vims may be^^^^ 
adapted to.the bridging host that the transgene-encoded protein i^ 

substantial infection and replication. ' 

rhimpric Vimses . ' 

In order to employ bridge hosts, it can frequently be advantageous to 

modify the subject viral genome by incorporating one or more portions of a genome of a 

virus which is known to replicate in the bridge host and which of similar type to the 

) subject viral genome (e.g., similar viral taxonomic classification), which can be called a 

"bridge vims". By incorporating a portion of a subject viral genome with a 
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complementing portion of a bridge viral genome, a chimeric viral genome which is 
capable of replication in the bridge host is created. 

Most frequently, it is useful to incorporate those portions of the bridge 
virus genome which are believed to encode functions that are substantially distinct 
5 between the subject viral genome and the bridge viral genome. These regions can be 
identified by highly divergent sequences between the two viral genomes, or can be 
regions containing genes known or believed to be important in controlling host range 
(e.g., surface glycoproteins such as the env gene of a retrovirus such as HIV-1). Often 
such critical genes are (1) viral glycoproteins, (2) polymerases or other transcription 

10 factors which must interact with host proteins or polynucleotides, or (3) viral non-coding 
sequences or secondary structures which must interact with host proteins (e.g., HIV-1 
TAR hairpin sequences). 

For example, if the subject virus is HIV- 1 and the desired bridge host is a' 
non-human primate, it is often advantageous to incorporate portions of a simian * 

15 immunodeficiency virus (SIV) viral genome to create a chimeric HIV/SIV viral genome, 
termed a "SHIV" viral genome. Kuv^ta et al. (1996) AIDS 10: 133 1 describe chimeric 
SHrV viruses composed of gag, pol, vif, vpx! nef, and LTR from STVmac and vpr, tat, , 
rev, vpu, and env of various HIV isolates. Chimeric viral genomes can be created by 
mixing predetermined portions of each genome on the basis of intelUgent prediction of 

20 their functionality in the bridge host (as per Kuwata et al), or the chimeric viral^genomes 
can be created by shuffling all or portions of each viral genome with the other viral 
genome and selecting shufflants which possess the desired phenotype, which is typically 
- enhanced replication in the bridge host. A variation employs chimeric oligonucleotides 
as PGR primers, wherein the chimeric primer has a first portion complementary to a HIV 

25 sequence and a second portion complementary to a SIV sequence to generate by PGR 
shuffled SHIV variants wherein the recombination junctions are principally the 
boundaries between the HTV sequence and the SIV sequence in the chimeric primers (see, 
Fig. 4). In this way, recombinationjoint location can be biased according to the 
practitioner's choice, which may be random, pseudorandom, or intelligent. The present 

30 invention thus provides for a collection of shuffled chimeric viral genomes which can 
then be subjected to selection for a desired phenotype. 
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Once a chimeric viral genome that replicates in the bridge liost is . 
established, forced evolution of the chimeric viral genome by the disclosed shuffling and 
selection method is employed. . In order to yield an evolved v^^^ 
the subj ect vinis that is started with and incoiporates the mimmar 
necessary to replicate in the bridge host, it is often desirable to backcross the.seleeted . 
chimeric viral.genomes with the parent subject yiral.genome. The backcrossing.is 
typically perfonned by shuffling with the parent subject viral genome and selection for 
retention of replication in the bridge host; frequently several iterative cycles of 
backcrossing are conducted. Alternatively,, or in combination with the above, 
backcrossing t6 the parent subject viral genome emi be done after the chimeric virus 
variants have been fiirther adapted for growth in the final ^i.e., non-bridge) host cell or _ 

organism., . . : • ■.' j ' . ■ ' 

• , ■ ' With regard to adapting fflV-l for growth in rodent cells, a chimeric SHIV 
genome or collection of SfflV genomes can be made by recombining one or more HIV- 1 
genomes or portions thereof with a SIVmac genome or complementing portion 

The resultant SHIV genomes can be propagated ancL'or: 

cells, such as macaque monkey cells (e.g., lymphocytes). Chimeric virus variants which 
. replicate in.the bridge cells are then selected (e.g.,;by harvesting virions or by recovering 
proviral DNA-from cells in which a reporter system indicates a replication phenotype) 
and shuffled with each other and/or with the parental subject viral genome: (e.g., HIV-1), 
- and optionaliy mutagenized, and the shufflants are subjected to oi^e or more subsequent 
rounds of phenotypic selection or screening. : , : - . 

' For illustration and not limitation, a SHIV chimeric viral genome is . 

generated and rescued as iMectious virions. TheSHIV virioiiS; are used to infect a , 
monkey cell line that is susceptible to SIV- infection,- and which optionally may express a 
' transgene encoding a huiiian CD4, CCR5, "and/of GXCR4 protein: The infected cells are 
incubated for a period of time suitable to permit replication, packaging, and egress of 
replication competent SHIV virus (e.g., an incubation period suitable for production of 
SIV virions following SIV infection). Replication competent SHIV genomes are 
obtained either as virions or, if a repUcation phenotype reporter is present, as proviral 
DNA from monkey ceUs which exhibit a replication phenotype.. The recovered SHIV 
genomes are then shuffled with each other, and optionally with other HIV and/or SIV 
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sequences, by a rc;cursive sequence recombination method (e,g., 
fragmentation/reassembly foirnat, template-switching, and the like) to produce shufflant 
SHIV genomes. Optionally, a mutagenic process (e.g., error-prone PCR, chemical 
mutagenesis, spiking with mutagenic oligonucleotides having random or pseudorandom 
5 sequence variation) is performed on the recovered SHIV genome sequences, either 

before, during, or after the shuffling step. The shufflants are rescued as infectious virions 
and a subsequent cycle of infection of the monkey cells is commenced. The cycle of (1) 
recovering virions or proviral DNA from cells in having a replication phenotype, (2) 
shuffling and optionally mutagenizing the sequences, and (3) rescuing infectious virions 

10 from shufflant genomes, is repeated until a desired level of replication in the host cells is 
obtained or until replication competence of the shufflants plateaus. After SHIV 
shufflants having the desired phenotype (e.g., improved replication in monkey cells) are 
obtained, they are used to infect mouse cells (e.g., niouse lymphocytes from a transgenic 
mouse expressing human CD4+ on peripheral T lymphocytes), and replicated virions or 

15 proviral DNA from cells having a repHcation phenotype are recovered so as to select for 
SHIV shufflants that are competent to rephcate in mouse cells. The recovered SHI V 
genomes may then be subjected to additional round(s) of shuffling (optionally including 
mutagenesis) and selection to optimize replication in the mouse cells. When a desired 
level of replication is obtained in the mouse cells, the SHIV shufflants are backcrossed to 

20 (i.e., shuffled with) the parent HIV-1 viral genome or a collection of HIV- 1 genomes, ' , 
optionally including a mutagenesis process, and the resultant shufflants are rescued as 
infectious virions and used to infect mouse-cells, and a recursive process of backcrossing 
to parent HIV-1 genome(s) and selection for replication in mouse cells will produ^ " 
chimeric HIV-1 viral genome that is predominantly derived from the parent HIV-1 . 

25 genome and which contams a minimal degree of SIV sequences and/or mutations 
necessary to provide the desired level of replication in mouse cells. 

Evolution of Component Sequences by Shuffling 

The present method of shuffling can be used to optimize subgenomic 

components, such as. structural genes, transcriptional regulatory regions, packaging 

30 sequences, replication sequences, subgenic functional domains, gene clusters, complete 

genomes, and the like), for a particular phenotype (e.g., functionality in a novel host 

species or cell type); The optimized components can then be shuffled into a replicable_ 
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viral genome so as to transfer the phenotypic characteristic to the shuffled viral 
geiiome(s). This approach ("bottom-up") is particularly beneficial when it is unusuaUy 
difficult to obtain a sparkplug variant for a desired phenotype simply from shuffling a 
collection of parental viral genomes which lack the phenotype. . 

Typically , a component sequence (or, collection of related-sequence 

component sequences) is operably linked to a reporter polynucleotide so as to form a 
reporter construct whereby the reporter construct, when transferred into a suitable host 
cell or organism, will report whether or not ,the component sequence exhibits the desired 
phenotype. For example and not to limit the invention, a component sequence of HIV-1 
may be a transcriptioiial regulatory sequence (e.g., TAR, LTR, RRE, INS, CRS), a viral 
regulatory protein (e:g., tat, rev), a viral accessory protein (e.g., vif, vpr, vpu,,vpx, nef), a 
viral glycoprotein (e.g., env), a viral polymerase (pdl), viral capsid protein (e.g., gag), or 



•the like: 



In general, the component sequence to be^ optimized is most commonly 'a 
15 . structural gene.(i.e., a polynucleotide sequence that encodes a polypeptide gene product) 
• or a.transcriptionalregulato:ry sequence (e.g., a viral promb^^^^ _ 

■ ^^a^ptibn temiinator sequ^c^, a secoodary sfructure-forming .sequeiice important for 
. .viral transcription and/or rephcation,,and the The type of readout produced by a 

' reporter construct is usually dependent upon 
20 . Berkhout et al. (1997) Nucleic Acids Res. 25: 940 report a non-shuffling rnethod for • . 
■ forced evolution of the TAR and polyAhairpins of the H^^^ . 
■ destabilize the capacity of HIV to repUcate under different temperatures in the human T 
• . lymphocyte cell line SupTl; similar assay f^^^ 

transcriptional regulatory sequences by.the present shuffling methods. In particular, it is • 
' 25 ■ desirable to optimize HIV-1 LTR, TAR, ahdRM sequences for function in human cells, 

■ . • . and these regulatory ek^^^ 

Tat and/or Rev, respectively. ' 

When the component to be optimized is a structural gene (e.g., tat, rev, 

. env, pol) the readout is typically the improved function of the polypeptide encoded by the 
30 gene in the p^cular host cell or organism selected by the practitioner. However, other 
readouts for optimizing structural genes can also be used; for example and not limitation, 
the readout can be the level of expression of functional-structural gene (e.g.. to optimize 
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codon usage in a host cell, to optimize translational efficiency or RNA processing 
efficiency, and the like). - Improved function of a structural gene product is determined 
by a suitable assay system and selection of such assay system is dependent upon the 
nature of the specific gene product and can be selected by those skilled in the art. For 
5 example and not limitation, an assay to measure function of a viral gene product (e.g., 
pol) necessary for replication of a virus can comprise measiuing the replication of a virus 
genome lacking said viral gene (e.g., pol) in a cell or organism in which the viral gene is 
. encoded by and expressed from an expression cassette that is separate from the viral 
genome (e.g., an expression vector encoding pol); this variation can be termed 

1 0 "complementation in trans' \ A library of such expression cassettes encoding the 
component can be screened for functionality, and functional species selected (e.g., 
enriched for) , and shuffled to produce a selected, shuffled library of shuffled component 
sequences that can be subjected to one or more additional rounds of functional selection 
and/or shuffling, so as to obtain one or more sequences encoding optimized component 

15 species (termed "optimized component sequences", or with specific reference to 
structural genes - "optimized structural gene component sequences"). 

When the component to be optimized is a transcriptional regulatory 
sequence, the readout is typically the improved transcription of a reporter polynucleotide 
(or the polypeptide encoded thereby) in the particular host cell or organism selected by 

20 ' the practitioner. The transcription of the reporter polynucleotide sequence can be 

detected by a method to detect transcription (e.g., PGR, LCR, hybridization with a labeled 
complementary sequence polynucleotide probe, inactivation of a conditionally lethal or 
. screenable gene product by antisense hybridization, and the like),- or by a method to detect 
an encoded gene product of the reporter polynucleotide (e.g., luciferase, P-galactosidase, 

25 HRP, GFP, and other suitable detectable reporter proteins). However, other readouts for 
optimizing transcriptional regulatory sequences can also be used; for example and hot 
limitation, the readout can be the level of expression of a viral structural gene operably 
linked and transcriptionally modulated by the transcriptional regulatory sequence. 
Improved function of a transcriptional regulatory sequence is determined by a suitable 

30 assay system and selection of such assay system is dependent upon die nature of the 
specific gene product and can be selected by those skilled in the art. For example and 
not limitation, an assay to measure function of a viral transcriptional regulatory sequence 
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(e.g., HIV LTR) necessary for viral gene expression and/or replication of a virus can 
comprise measuring the transcription rate of a reporter polynucleotide sequence operably 
linked to the transcriptional regulatory sequence in a cell or organism in, which the viral 
transcriptional regulatory sequence is to be optimized. The reporter sequence is typically- 
encoded by'and expressed from an expression cassette that is separate from the viral \ 
genome (eg., an' expression v^tpr in which the HIV LTR is positioned to. drive 
transcription of a reporter sequence, such as a GFP-encoding sequence). A library of 
such expression cassettes encoding the transcriptional component can be screened for 
functionality, and functional species selected (e.g.,, enriched for), and shufHed to produce 
a selected, shuffled library of shuffled transcriptional component sequences that can be ■ 
subjected to one or more additional rounds of fiinctional selection and/or shuffling, so as 
to obtain one or more sequences encoding optimized transcriptional component species 
(which are a subgenus ofoptimized component sequences). . ' . 

In variations, viral component sequences (whether structural genes or 
regulatory sequences) caii be shuffled with otlier similar type components from related 
viruses (e.g., shuffling an LTR, or portion thereof, fiotn HIV-1 with an LTR from simian 
immunodeficiency virus. SIV) or sequences from animal genomes having similar 
structure (e.g , greater than 50 percent sequence identity) and/o^ 
. (e.g., binding RNA poi n). In this manner, chimeric component sequences can be 
generated and those having a desired phenotypic property (e.g., modulating transcription- 
of .an operably linked sequence in a viral genome) can be selected for after one or more 
iterative rounds of sequence shuffling arid selection. • 

With particular reference to HIV-l, components which naturally occur in, 
overlapping reading frames in the refroviral genome can be optimized for fimction either 
separately (i.e., unconstrained by overiapping reading frames) or in native configuration. 
It will ill some cases be possible to optimize multiple ovarlapping reading.frames ■ • 
concomitantly, whereas in other cases it can be preferable to optimize a component 
separately and then incorporate the optimized sequence (or consensus sequenc^^ 

collection ofoptimized sequences is obtained) into an overlapping reading frame 
configuration if that is desired or necessary for obtaining function or replication of the 
viral genome. However, in some instances optimized components will be used 
separately and need not be suitable for use in a replicable viral genome. 
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Exemplary Components 

A component can be any subgenomic sequence comprising more than 1 0 
consecutive nucleotides of a viral genome, typically comprising all or a substantial 
portion of a viral structural gene, transcriptional regulatory sequence, or replication 
5 control sequence. A component can also be any nonviral sequence of more than 10 
consecutive nucleotides of a structural gene or transcriptional regulatory sequence from 
an animal cell genome (or mRNA pool), wherein said sequence encodes a protein 
involved in viral entry, viral transcription, viral replication, or viral egress, or wherein 
said sequence regulates transcription of a viral sequence (whether as in integrated 

10 provirus or as an episomally replicating viral genome). 

To illustrate the invention and not to limit it, the following non-exhaustive 
' list of viral components can be obtained from an HIV-1 genome: gag MA (PI 7), gag CA 
(p24), gag NC (p7,p6), protease (pi 5), reverse transcriptase/RNase H (p66,p51), 
integrase, Env,(gpl20/gp41), Tat (pl6/pl4), Rev (pl9), Vif (p23), Vpr (plO-15), Vpu 

15 (pl6), Nef (p27/p25), Vpx (pl2-16), Tev (p28),U3 sequence, U5 sequence, primer 
binding site sequence (PBS), polypurine tract (PPT), repeat region (R), long terminal 
repeat (LTR), minimal HIV promoter (NF-kB site, Spl sites, TATA box, transcription 
initiation site, Tat-responsive element (Tar), Rev-responsive element (RRE), splicing 
signals, and other open reading frames or transcriptional regulatory regions of the HIV 

20 genome. Similar components from SIV can often be used, and may supplement or 
replace the cognate components (or portions thereof) of the HIV-l component. 

To illustrate the invention and not to limit it, the following non-exhaustive 
list of nonviral (host cell) components can be obtained from a genome or mRNA pool of 
an animal ceil and are believed important in HIV-1 entry, replication, or egress: cellular 

25 factors that bind to Tar or to Tat, factors encoded on human chromosome 12 that 

contribute to the transcriptional activity of Tat, CD4, CXCR4, CCR5, p561ck, NF-kB, 
Spl, other coreceptors for HTV-l attachment or entry, other host factors necessary for 
HIV-1 replication, and the like. 

Although the examples provided reference HIV-1, those skilled in the art 

30 will be capable of selecting components from the particular virus type they desire to work 
with. 
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Reporter constructs 

A reporter construct comprises a component sequence in a form suitable to 
be transcribed or otherwise acted upon (e.g., bound by repUcation proteins) in a host cell. 
When the component sequence is derived from a structoral gene, the component sequence 

5 is generally operably linked to a franscriptional regulatpry sequence that promotes 
transcription of linked sequences, in the host cell; the operably linked structural 
component and transcription sequence compose an expression cassette, which may be 
present on an episome (e.g., a plasmid, a viral genome, or an artificial chromosome), or 
may be integrated into a host chromosomal locus, or may be non-replicable and non. 

10 integrated. When the component sequence is a viral transcriptional regulatory sequence, 
the reporter construct is composed of a reporter geiie generally operably linked to the 
viral transcriptional regulatory sequence; the operably linked reporter gene and viral 
tnmscrii>ti6n sequence •compose an-exprcssi6n cassette, whichmay be present on an 
• episome (e.g„ a plasmid, a viral genome, or an arti ficial chromosGme), or may be ' 

15 integrated into a host chromosomal locus, or may be non-replicable and non-integrated'. 

Kokenetal. n 994^ Gene 243. describe, a promoter assay based on the 
transcriptional activator Tat of HIV-l; this type of assay can be used in modified format 
to screen for shuffled and/or mutated Tat sequences that can activate an fflV LTR-CAT 

reporter plasmid in a non-^human cell (e.g., a mouse cell). White et al. (1995) 
.20 Science 108:441 disclose a reporter system for real time analysis of transcriptional 

regulation of fflV promoters in mammaUan cells using a reporter system comprising an 
HIV promoter operably linked to a luciferase reporter gene and transient transfection with 
an expression vector encoding HTV-1 Tat; such a system and variations thereof can be 
used to screen for shuffled Tat variants that are functional for fflV-1 promoter activaUon 
25 in non-human host cells. 

■ NeuveutandJeang(1996)iJ^^ 
identifying trans activation of reporter plasmids by Tat, Suitably modified variations of 
the assays described, and similar trans-activation assays for other viral functions (e.g.. 
Rev and RRE function) can be employed to evaluate transcription functionality of such 
30 components in non-human host cells. 

Other variations of suitable reporter constructs will be apparent to those 

skilled in the art. 
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Shuffling and Selection 

A pliirality of speciiss of a component are obtained, either by mutating a 
starting component specie to create a pool of mutated component species or by beginning 
with a plurality of component species (e.g., component "alleles" obtained from a plurality 
5 of virus isolates or even different virus types, such as HTV- 1 and SIV), or other methods. 
The pool of component species can be either be incorporated into reporter constructs, 
introduced into host cells, and selected for a desired phenotype.prior to the first round of 
shuffling, or may be initially shuffled before any selection is performed. 

The plurality of component species is shuflHed by a suitable sequence 

1 0 recombination method (e.g., by DNase fragmentation and PCR-based reconstitution of 
overlapped joints, or by any of the variety of isuitable sequence shuffling methods 
described herein and elsewhere, and as is known in the art) to generate a library of 
sequence-recombined ("shuffled") component polynucleotides. The library of shuffled 
component sequences, typically in the fomi of reponer constructs, are introduced into 

15 host cells by a suitable method (e.g., transfection, electroporation, viral infection, 

lipofection, and the like) and the resultant pool of introduced shuffled reporter constructs 
are selected or screened for the desired fimctionality of the shuffled component , 
sequences. Those library members (or progeny thereof) which comprise shuffled 
component sequences having a desired phenotype are recovered and the resultant pool of 

20 selected shuffled component sequences can be put through one or more additional cycles 
of recursive sequence shuffling to further optimize for the desired phenotype(s), or for 
additional phenotype(s). Mutagenesis and/or spiking can be used in conjunction with 
> shuffling to further enhance the sequence diversity in one or more rounds of shuffling,. . . 
Suitable mutagenesis methods are selected at the discretion of the practitioner, but for 

25 illustration and not limitation can include: site-directed mutagenesis by mutagenic 
oligonucleotide, error-proiie PCR, chemical mutagenesis, mutagenic irradiation, 
propagation of polynucleotides in error-prone hosts, and the like. 

Recovery of Selected Polynucleotide Sequences 

A variety of selection and screening methods will be apparent to those 

30 skilled in the art, and will depend upon the particular phenotypic properties that are 

desired. The selected shuffled viral genome sequences can be recovered for further 

shuffling or for direct use by any applicable method, including but not limited to: 

recovery of virions from cells or extracellular mediiiin (e.g., ascites, serum, spenfcell" 
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culture medium).recovery of infectious particles (as pfti) from cells or extracellular 
medium, recovery of nucleic acids (or PCR-amplified copies thereof) from cells or 
mediuiii, recovery of prbyiral sequences from host chromosomal DNA or P 
copies therebf, recovery of episome (e.g., expression vector) such as a plasmid, cosmid, 
5 viral vector, artificial chromosome; and the like; or other sui 

■ in the art. 

Selected component sequences are typically recovered from the selected 
host cells having the desired phenotypic property (e.g., GFP expression) and can be- 
obtained by harvesting the reporter construct itself (e.g., as an episome or an integrated 
10 chroinospraal sequence), in which case it is frequently desirable to have unique restriction 
sites flanking the component sequence (or variable portions thereoO to be recovered and 
shuffled.- Alternatively, any suitable art-lcnown method, including- RT-PCR .or PCR, can 
be used to obtain the selected coihponent sequence(s) for subsequent manipulation and 

shuffling. ■ ' , • , 

15 - Rprnnstitutiop nf Shuffled C omponent Sequences ' 

Although selected shuffled component sequences can be used for non-viral 

applications (e.g., using an optimized viral promoter in a non-viral gene therapy vector, 
and the like), a major use of such selected component sequences is the incorporation of 
' the optimized component sequence into a replicable viraU 
20 cDrtibonent sequence can simply be cloned into the ^priate position on an" existing 
. viral genome, typically in place of the naturally-occuning component (e.g., an optimized 
Tat is used to replace the native Tat gene in HW). Alternatively, ^ 
component (or collection thereof) can be shuffled with a viral genome 

' , genomes wherein the naturally-occiirring component sequence is substantially replaced 
25 byoneormoreoptimizedcomponentsequences.' Anothermethodistosimplyrecreat^ 

viral genome de novo by Slicing together optimized component se^^ and, if 
desiredi native.viral genomic sequences, to produce a synthetic viral genome from a 
plurality of optimized sequences. Such reconstituted viral genomes can be subjected to 
one or more rounds of selection and/or shuffling. Since components which are optimized 
30 in non-viral reporter systems may not be necessarily be concomitantly optimized for 
function in a viral genome, it can often be necessary to optimize tiie selected 
component(s) in a replicable viral genome (e.g., a wild-type viral genome) so tiiat the 
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component is optimized for function in the context of a replicable viral genome (i.e., a 
context-optimized component). 

Rescue of Infectious Virus f"om Cloned Viral Sequences 

One objective of the general method of shuffling viral genome sequences 

5 to produce shuffled sequences encoding a desired phenotype ultimately is the generation 

ofvirus variants that exhibit altered host range and/or cell tropism. In order to 

accomplish this expeditiously, it is sometinies preferable to employ a system to rescue . 

infectious virus from cloned viral sequences. This may often be as simple as transfecting 

a viral genomic polynucleotide into a suitable host cell in which the viral genome can 

1 0 express necessary replicative functions, replicate the genomic polynucleotide, encapsidate 

the genomic polynucelotide, and egress the cell (if appropriate). Sometimes it is : 

necessary to utilize a helper cell line or helper virus to obtain replication and packaging. 

The helper cell line or helper virus typically provides a function in trans (e.g., viral 

polymerase) that facihtates an important step in viral replication and/or packaging. In the 

15 case of some viruses (e.g., negative strand RNA viruses), it can be necessary to form 

. ■ ■ ■ * . ■ ^(J 

appropriate ribonucleoprotein complexes in order to support efficient replication of the *S 

skilled practitioner will select the rescue system appropriate for the particular virus that is 
to be shuffled. 

20 - With regard to. HIV, there are a variety of suitable methods to recover . j| 

. infectious molecular clones, such as integrated or circularly pennuted, non-integrated 
proviral forms, or subgenbmic proviral sequences that can be reconstituted into full- 
leng{h pr6\ams (G^^ (1994)"AIDS Res Hum Retrovir^ ' ' 

(1993) Virology 194: 858; Li et al (1991) J. Virol 65: 3973; Fredriksson et al (1991) 
25 Virology 1 8 1 : 55). PCR can be used to construct infectious molecular clones of HIV- 1 
from full-length provinis. Infectious molecular clones of HTV can be obtained from the 
NIAID AIDS Research and Reference Reagent Program (Bethesda, M 
. pubUcly available source, or can be generated by the practitioner. Salminen et al. (1995) 
Virology 213:80 disclose a rhethod for recovering full-length HIV- 1 provirus DN A fi-om 
30 primary virus culUires by using PGR; this methodology can be used to recover HIV . 

provirus from starting materials (e.g., HIV primary isolates) for subsequent shuffling, and 
to recover proviral DNA from selected HIV shufflants. Landau and Littman (1992) J.^ 
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Virol." 66: 5 110 describe a method for produciiig helper-free retroviruses • 
using packaging cell lines; the system was used to alter cell trbpisra^ of a retrovirus by 
incorporating envelope glycoproteins from other viruses and mutant envelope 

glycoproteins. * 

TTnrrpH F.vnhitinn of fnV-l ' • 

HIV-1 replication, as defined by the production of infectious vinons, . . 

requires the completion of a series of steps,, each of which results from highly - 
sophisticated interactions between viral and cellular factors. Early.events culminate in 
the. irreversible, integration of the viral genome into the host cell chromosome, and late , 
events exploit the host ceUular transcriptional and tr^slational machine^ 

production ofnew viral particles. . . 

Unfortunately, HIV4: does not infect or replicate to any significant degree 
in convenient non-human model systems, such as mice.- Patience et al. .(1997) iMlxe^ 
Medicine 3 : 282 reports pig endogenous retrovirus (PERV) can replicate in human cells. 
. Ifwould be desirable to generate HIV-1 variants which can replicate in rnice or mouse 

■ cells. ■■ .■ ''■ ■ ' ■ - 

The receptors which fflV-l uses to get into cells are normally involved in 

the signaUrig that takes place between cells of the immune system, The molecules that 
bind to these receptors are called chemokines, and the jeceptprs, in tum^ are called ' 
<J0 chemokine receptors. .^The two main families of chemokines are the CCs and the CXCs, 
and likewise there are two parallelfamilies of chemokine receptors, the, CCRs and the 
CXCRs. In addition to using GD4 as a viral receptor, HIV-l uses a receptor belonging to 
the first family, called CCR5, to gain entry to macrophages during the initial infection and 
dunng the many years of asymptomatic phase of the disease. Later,, the virus ihutates 
25 naturally in the infected human host, allowingit to gain entry into T cells through a ^ 
member of the second receptor faimly, called CXCR4V^^^^ 

it is desirable to have a cost-effective non-human model of HIV disease 

and HIV viral replication. One approach to develop such as model is to make a 
transgenic mouse expressing a human CD4, human CCR5, and/or human CXCR4 on T 

30 lymphocytes and other cell types. However, even if they can infect such transgenic 

mouse cells, most or all HPV-l isolates may not replicate efficiently in mouse cells. Thus 

it is desirable to make a "murine replicable HIV" by exploiting recursive sequence 
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recombination of HI V4 sequences and selection for mutant, shuffl 
HIV- 1 sequences that have enhanced function and replicability in mouse cells. There are 
many alternative approaches to making such murine replicable HTV viral genomes by 
shuffling; these alternative variations will be apparent to the practitioner, and some 
5 specific variations are described herein for illustration and not limitation. 

Generation of HIV Competent to Replicate in Mouse Cells 

Viral genomes from HIV isolates can be shuffled with each other, with 

mutated HIV genomes, and/or with SIV or murine-tropic retroviral (ML V) genomes. ' { 

The shufflants can be introduced into mouse cells expressing human CD4; human CCR^ 

10 and CXCR4 and selected for capability to replicate in the mouse cells and produce 

infectious virus that is capable if infecting such transgenic mouse cells. Once a desired 

level of replication of the evolved HTV shufflants is achieved, additional properties may 

be selected for, such as independence from human CD4 by performing additional cycles 

of recursive shuffling and selection on mouse cells expressing CCR5 and/or CXCR4 and - f 

15 lacking human CD4. In a variation, HIV- 1 genome sequences are shuffled with a HIV -,2 ''f\- 
env gene, which is independent from CD4 for viral entry, to produce shufflants that , 
encode an env protein that does not obUgatorily require human CD4 for virus entry. ' ' ' ' 

Such env genes may be chimeras between a HIV-1 env and a HiV-2 env, or may be ' - 

predominantly or exclusively HIV-2 env sequence, and possibly include additional • 
20 mutations introduced as part of the recursive shuffling process. VtV^ 

Backcrossing to Specific Clades or Parent HIV Isolates 

HIV-1 isolates can be grouped according to . phylogenetic sequence 

„ . ^ similarities into-categories referred to in the art as clades (Gao etal. (1994) ATOS - 

Research and Human Retroviruses 10: 1359). Onceamurine-replicableHTV-l shufflant 

25 having a satisfactory capacity to rephcate in mouse cells is obtained,, recursive sequence 

recombination can be used to backcross the replicable HTV variant to one or more^ 

naturally-occurring HTV sequences, such as the wild-type parental backbone(s) firom 

which the HIV variant was derived or to other HIV isolates. By performing multiple - 

cycles of shuffling (backcrossing to a naturally-occurring HIV sequence and/or to a 

30 consensus sequence representing one or more clades), and selection for retention ofthe 

phenotype of replication in mouse cells, it will be possible to make murine-replicable 

variants of essentially any HTV isolate or clade representative sequence. In order to 
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expedite thfe backcrossing process and reduce the number of cycles required, it is often 
preferred to employ a fme-grained shuffling process and to control the ratio of wild-type 
sequences to murine-replicable Variant sequences to assure obtaining replicable shufilants 
having theniaximum wild-type genetic content. . ' , ' 

' In these ways and variations thereof, it is possible to make murine- 
feplicable shiiffled HIV variants of any given virus isolate or clade, or even to make a 
representative.pool of murine-replicable variants representing a plurality of clades. 
Sequenciiig of resultant murine-rephcable variants and computer-assisted sequence ■ 
comparison to starting viral genomes and backcrossed viral genomes will permit 
';identifi^tidtt df particular mutations or mutation pliistcrs that are-resporisible^rdt. . 
replicability in mouse'cells. ■ 
Miilticlade Mixing 

' • ■ ■ ' , Recursive sequence shuffling can be used to generate collections of 
recbmbiriant HIV variants that represent sequence di versity from a multiplicity of HIV 
. clades. Frequently in dnig development screemng assays it may..be necessary to spreen 
each drug candidate. against each individual clade to assure efficacy for all clades or to 
identi'fy .ihe moldcular'-baas forMst^nce' among resistant clades -or isolates. The . - 

■ ■cdllection of shuffled variants representing multiple clades (or multiple drug resistant ; 

■ variants);can^be used as a^single benchmark to assay for drug resistance and rapidly 
identify the specific sequences responsible for the drug resistance. The pobl of shuffled 
variants are used to .infect-a suitable cell line or non-hunian host animal and the drug is 
administered. Drug-resistant HIV variants will replicate and sensitive variants will not. 
By selecting for resistant variants and performing one or more recursive rounds of 
shuffUrig and syectipn, the variant HIV genomes obtained will represent drug-resistaDt 
genotypes Which can be sequenced and/or backcrossed to parental sequences to identify 
which mutations are conserved tiirough backcrbssiiig and replication selection in'the. 
presence of me drug; such mutations are likely important for tiie drug-resistance 

phenotype. 

Bridging in Primate Cells 

As described supra, non-human primate cells (e.g., macaque monkey cells, 
spider monkey cells) and/or non-lymphocytic cells (e.g., N1H3T3 cells) can be used to 
bridge HIV virus evolution from a human T-lymphotropic virus to variant viruses having 
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other host ranges and/or cell tfopisms. Many non-human primate species can be used as 
a source of cells, which may be propagated in primary cell culture or immortalized by a 
variety of art-known methods. Alternatively, or in combination, with passaging the fflV 
virus (or mutagenized and/or shuffled variants thereof; including SHTV chimeras) in non- 
5 human primate cell cultm-es, it is also possible to passage these viruses in intact non- 
human, primates, and recover the evolved virus variants from tissues or fluids of the 
primates and subject the recovered variants to recursive sequence shuffling and selection 
for replication in the non-human primate. . 

Virus Evolution in a Transgenic Mouse 
10 HIV-1 shufflants can be introduced directly into transgenic mice harboring 

a transgene that.encodes and expresses a human receptor for HIV (e.g., CD4, CCR5, 

CXCR4, etc!), and infective and replicable variants can be recovered from tissues (e.g., 

lymphoid tissues, peripheral blood lymphocytes) or fluids (e.g., serum, ascites) of the 

mouse. The mouse may also have a reservoir of human lymphoid tissue, such as a 

15 SCID/hu mouse with a human thymus/liver sandwich implanted under the kidney 

capsule. The reservoir of human lymphoid tissue can serve as a reservoir of human cells 
competent to replicate shuffled HIV variants such as may repHcate poorly in mouse cells 
at early cycles of a forced evolution to modify host range to include mice. The human 
cell reservoir can amphfy, by replication, the number of variant HIV viruses that can 

20 rephcate in the mouse cells, as well as increase the background of HIV variants which are 
replicating solely in the human reservoir cells. However, since subsequent selections can 
be done with virus recovered from the animal and repUcated in the absence of human 
cells, the iiicreased background of human-specific MV is not problematic. 

Mixed Particle Infection fHieh MOD 
25 Superinfecting host cells at a high multiplicity of infection (MOI) can be 

used to advantage to increase the recombination between viral genomes. Preferably an 

MOI of 5 to 50 or greateris used to enhance recombination during the viral replication 

cycle in the cell. 

Identification of Novel Human HTV Cofactors 
30 Mouse cells non-permissive for HTV- 1 infection can be used for 

expression screening of human cDNA libraries to identify cDNA sequences that encode 

proteins which confer permissivity to HIV-1 infection and/or replication. In an 
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embodiment, a mouse lymphocytic cell line expressing a human CD4l ?i human CCR5, 
and a human CXCR4 protein is non-permissive for HIV- 1 infection and substaiitial ^ 
replication. , The cell line contains a reporter system to report the presence of a 
replication phenotype (e.g., GFP operably linked to a Tat-dependent promoter). The cell 
line is transfected with an expression library encoding a plurality of human^ cDN As which 
are operably linked to a constitutive transcriptional regulatory sequence (e.g. , CMV 
promoter) and cells which express the reporter indicating infection and a replication 
phenotype are selected and the cDNA expression vector species contained therein are 
evaluated for the ability to. reproducibly confer enhanced infection and replication of HIV 
in the transgenic mouse cells. Selected cDNAspecies are identified as HIV cofactors 
and constitute targets for development of novel antiviral drugs. - The selected cDNA 
sequences can also be incorporated into a tr'ansgene to generate mouse cells (or transgenic 
mice) having enhanced permissivity for HIV infection and repHcation. 

Site-Specific Recombination and Amplification 
^ ' In order to bias recombination to specific regions of the viral genome 

sequence, it is possible to engineer into the viral genome sequence one or more site- 
specific recombination sites (e.g., loxP, fit). By employing a complementing site- . 
specific recombmase (e.g, CRE or FLP recombinase), it is possible to create , , ' 
recombination hotspots to bias the shuffling process. The site-specific recombination 
can be done in vitro, or, by employing a transgenic cell or animal expressing the 
recombinase, can be done in vivo.. Furthermore, a tandemly repeated array of viral 
genomes or subgeriomic sequences, containing site-specific recombination sequerices . 
(e.g., such as a transgene containing tandem repeats of a provirus) can be used to obtain 
intra-array recombination to enhance diversity. 

Often, it is desirable to ampUfy the viral genome sequence(s) to increase 
the amount ofrecombinogenic viral genome sequence present,. In an embodiment, a 
transgene containing tandemly repeated arrays of aproviral genome is operably Unked to 
an expression cassette encoding a selectable marker that can be amplified (e.g., DHFR). 
Selection for amplification (e.g., increasing concentrations of MTX) is applied to the 
transgenic cells to expand the number of proviral genomic sequences present. After (or 
concomitant with) amplification, a site-specific recombinase is activated, either 
transcriptionally or allosterically (e.g., by Ugand-induced activation of an expressed 
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chimeric protein comprising the ligand-binding domain of a steroid receptor and the site- 
. specific recombinase), so as to produce site-specific recombination among the proviral 
genome sequences and thereby effect shuffling and the production of shuffled viral 
genome variants, 

5 Combinations 

Combinations-of the shuffling and selection strategies disclosed herein can 

be used. 

Defective HIV Variants Having Enhanced Safety 

Once shuffled, HIV variants that are capable of substantial replication in 

10 mice are established, one or more viral genomic sequence(s) necessary for the altered host 
range and/or tropism and that function in trans can be deleted from the shuffled and 
adapted viral genome and provided in trans in the host cell or animal (e.g., as a transgene 
expression cassette), so that the host provides the helper function that complements the 
replication of the virus, but the resultant vims that is produced is non-infective for 

15 organisms that lack the helper function. In this way, a model system of a transgenic 
animal providing an internal helper ftmction can be used in conjunction with a 
repUcation-deficient HIV virus to develop antiviral drugs and study HIV disease without 
fear that infectious, replication-competent vims will be produced and infect lab workers 
or escape into human or animal populations. . ,;,v 

20 Attenuation Phenotvpes 

Shuffling can be used to generate vims variants having attenuated 

phenotypes, such as reduced pathogenicity and/or virulence. One general type of such 
~ ' attenuated variants are the temperature-sensitive and/or cold-adapted mutants. In this ~ 

aspect, selection of shuffled variants would select for shuffled viral genomes that 
25 replicate efficiently at reduced (or elevated) temperature. Other attenuation types can ^ 

also be selected for. 

Other Phenotypes 

The present method can be used to generate variant viruses having a wide 
variety of altered phenotypes. Illustrative examples not intended to limit the scope of the 
30 invention are: (1) capability to replicate in a non-permissive cell, (2) host range and/or 
cell tropism distinct firom naturally-occurring wild-type virus, (3) improved virus titer 
(e.g., virulence), (4) decreased pathogenicity and capacity to produce disease, (5) 
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modified infectivity, (6) improved packaging capacity, (7) improved physical/cheniical 
stability of viral particles, (8) improved intracellular stability, (9) improved expression of 
one or mor^ viral genes; (10) modified chromosomal integration, (1,1) altered tissue - - 
specificity and capability to infect preferentially specific organs which the natural virus 
5 does not substantially infect, (12) reduced immunogenicity or virus or viral protein in a' 
host (e.g., a human), (13) improved immunogenicity of virus or viral protein in a host 
(e.g., human), (14) improved function as a biological adjuvant (e.g., to co-express a viral- 
encoded human cytokine), (15) improved fiinctipn as a therapeutic (e.g., capacity to ^ 
induce a general antiviral host response - such as interferon production), and others 
10 pheno types apparent to the piractitioner. ' 

Adapted Virus :Host System 

One use of the present technology is to generate a non-human animal (e.g. , 
a transgenic mouse) that can'support replication of a shuffled virai genome that has been 
adapted for infection and/or replication in the animal (or isolated cells therefrom); such a 
15 transgenic mouse-adapted virus system has many commercial uses, including screening 
, for agents that inhibit viral replication, entry, egress, or other aspects of a viral life cycle. 
Furthermore, these systems can be marketed to research instittitions and pharmaceutical 
organizations as commercial research tools (e.g., like restrictioi: enzymes or 
oncogenically susceptible mice like p53 knockouts). \ * . 

20 Whole Genome Shuffling 

' ' In one embodiment,, the selection methods herein are litiUzed in a "whole 

genome shuffling" format. An extensive guide to the many forms of whole genome 

shuffling is found in the pioneering application to the inventors and their co-workers 

entirted "Evolution of Whole Cells and Orga^ 

> 25 Recombination," Attorney Docket No. 018097^020720US filed July 15, 1998 by~del 

Cardavre et al. (USSN ) . / ' / ^ • 

In brief, whole genome shuffling makes no presuppositions at all regarding 

what nucleic acids may confer a desired property. Instead, entire genomes (e.g., from a 

genomic library, or isolated firom an organism) are shuffled in cells and selection 

30 protocols applied to the cells. As applied to the present invention, entire viral (or host 

cellular) genomes are optionally shuffled to produce viruses with a desired tropism, or 

cells which will support growth of selected viruses. 
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Cjdon Modification Shuffling . - 

Procedures for codon modification shuffling procedures are described in 
detail in SHUFFLING OF CODON ALTERED GENES, Philhp A. Patten and Willem 
P.C. Stemmer, Attorney Docket Number 018097-028500US, filed September 29, 1998. 
5 In brief, by synthesizing nucleic acids in which the co.dons which encode polypeptides are 
altered, it is possible to access a completely different mutational cloud upon subsequent 
mutation of the nucleic acid. This increases the sequence diversity of the starting nucleic 
acids for shuffling protocols, which alters the rate and results of forced evolution 
procedures. Codon modification procedures can'be used to modify any viral nucleic acid 
10 herein, e.g., prior to performing DNA shuffling. This can have the benefit of allowing the 
virus to adapt to a host cell's codon selection, e.g., prior to shuffling. 

Use of RecA 

The frequency of homologous recombination bets^'een nucleic acids can be 
increased by coating the nucleic acids with a recombinogenic protein, e.g., before or after 

15 introduction into cells. See Pati et al, Molecular Biology of Cancer \, 1 (1996); Sena 8c 
Zarling, Nature Genetics 3, 365 (1996); Revet et al, 7. Mol Biol. 232, 779-791 (1993); 
Kowalczkowski & Zarling in Gene Targeting (CRC 1995), Ch. 7. The recombinogenic 
protein promotes homologous pairing and/or strand exchange. The best characterized 
- recK protein is from E, colt and is available from Pharmacia (Piscataway, NJ). In 

20 addition to the wild-type protein, a number of mutant recA-like proteins have been 

identified.(e.g., recA803). Further, many organisms have recA-like recombinases with 
strand-transfer activities (e.g., Ogawa et al, Cold Spring Harbor Symposium on 
Quantitative Biology 561-516 (1993); Johnson & Symington, Mol Cell Biol 15, 
4843-4850 (1995); Fugisawa et al., Nucl Acids Res. 13, 7473 (1985); Hsieh et al., Cell 

25 44, 885 (1986); Hsieh et al, J. Biol Chem. 264^ 5089 (1989); Fishel et al., Prac. Natl 
Acad. Scl USA 85, 3683 (1988); Cassuto et al, Mol Gen. Genet, 208, 10 (1987); Ganea 
et al., Mo/. Cell Biol 7, 3124 (1987); Moore et al, J. 5io/. C/iem. 19, 11108 (1990); 
Keene et al,, Nucl Acids Res. 1 2, 3057 (1984); Kimiec, Cold Spring Harbor Symp. 48, 
675 (1984); Kimeic, Cell 44, 545 (1986); Kolodner et aU Proc. Natl Acad. ScL USA 84, 

30 5560 (1987); Sugino et al., Proc. Natl Acad, Scl USA 85, 3683 (1985); Halbrook et al., J. 
Biol Chem, 264, 21403 (1989); Eisen et al., Proc, Natl Acad, ScL USA 85, 7481 (1988); 
McCarthy et al., Proc, Natl Acad, ScL USA 85, 5854 (1988); Lowenhaupt et al., J. Biol 
^C/re^2M720568"(I989)rExmnples^ofsuc 
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recA803, uvsX, (Roca, A.I., Crit. Rev. Biqchem, Molec. Biol. 25, 415 (1990)), sep\ 
(Kolodner et al., Proc. Natl. Acad. Sci. (U.S.A.) S4, 5560 (1987); Tishkoff et al., Molec. 
Ceil. BioL 11, 2593), RuvC (Dunderdale et al., Nature 354, 506 (1 991)), DST2, KEMi, 
XRNl (Dykstra et al.,A/o/ec. Cell. Biol. 11, 2583 (1991)), OT/D^H (ClWkef al., ' 
Molec. Cell. Biol. 11, 2576 (1991)), HPP-l (Moore et al, Proc. Natl. ' Acad. S'ci. (U.S-A.) 
88, 9067 (1991)), other eukaryotic recombinases (Bishop et al, Cell 69, 439 (1992); 
Shinohara et al.. Cell 69, 457. recA protein fonns a nucleoprotein filament when it coats 
a single-stranded DNA. In this nucleoproteiri filament, one monomer of recA protein is 
bound to about 3 nucleotides. This property of rgcA to coat single-stranded DNA is 
essentially sequence independent, although particular sequences favor initial loading of 
recA onto a polynucleotide (e.g., nucleation sequences). The nucleoprotein filament(s) 
can be formed on essentially any .DNA to be shuffled and can form complexes with both 
single-stranded and double-stranded DNA in'prokaryotic and eukaryotic cells. recA 
mediated techniques are also found in WO/93/22443. 

■ Before contacting with recA or other recombinase; fi-agments are 

. optionally denatured, e.g., by heat-treatment. recA protein is then added at a ' 
conceritration of about 1-10 \iM: After incubation, the recA-coated single-stranded DNA 
is introduced into recipient cells by conventional methods, such as chemical ' , 
transformation- or electroporation. In whole cell shuffling techniques, the fragments 
undergo homologous recombination with cognate endogenous genes. Because of the 
increased frequency of recombination due to recombinase coating, the fragments, need not 
be introduced as components of vectors. 

Fragments are sometimes coated with other nucleic acid binding proteins 
that promote recombination, protect nucleic acids from degradation, or target nucleic 
acids to the nucleus. Examples of such proteins includes AgrobacteriumvirE2 
(Duirenberger et ^, Proc:-NailAca<L Sci. [75^86,9154-9158(1989)). Aherhativeiy, 
recipient strains can be deficient in recD activity. Single stranded ends can also be 
generated by 3'-5' exonuclease activity or restriction enzymes producing 5' overhangs. 

The broad scope of this invention is best understood with reference to the 
following examples, which are not intended to limit the invention in any manner. The 
following examples are offered by way of illustration, not by way of limitation. 
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EXPERIMENTAL EXAMPLES 
The following examples are illustrative and not limiting. One of skill will 

realize a variety of parameters which can be changed.to achieve essentially the same. 

results. " ' ' , 

5 EXAMPLE 1 : EVOLUTION OF NOVEL PHENOTYPES IN HIV BY INTRA- AND 
INTERCLADE SHUFFLING • 
The diversity of HIV sequences in natural and laboratory isolates is 

utilized to generate a library of recombinant HIV sequences from which strains with 

desired characteristics are selected. These include novel tropisms on cells from species 

10 normally refractory to HIV infection, the use of alternate receptors to enter cells and 

improved replication kinetics. . - 

Method 

Sources of HIV Sequences 

Subgenomic sequences of various regions of the HIV genome are obtained 
15 from: 1) available molecular clones of different HIV strains; 2) PGR using consensus or 
degenerate primers from genomic DNA of chronically infected tissue culture cells; 3) RT- 
PCR of HIV particles from supematants of chronically infected cells or patient fluids. 

.\ wide collection of such sequences are collected from multiple clades of 
HIV. These subgenomic HIV sequences are cloned into bacterial plasmids that will be 
20 used as templates. 

DNA Shuffling 

Shuffling is performed either by: 1 ) Directly performing circular shuffling 
of plasmids carrying analogous regions of the HIV genome from different isolates; or 2) 
Pooling PGR fragments amplified from plasmids carrying analogous regions of the HIV 

25 from different isolates genome and performing linear shuffling. 

Shuffled material is amplified using primers incorporating specific 
restriction sites. These restriction sites enable the shuffled amplified fragments to be 
functionally cloned into the backbone of an infectious HIV clone (pNL4.3) containing the 
remainder of the HIV genome as in the case of MLV fiill length reconstruction where the 

30 Moloney MLV clone provided the backbone {see infra). This reconstitutes a full length 
HIV clone. A library ofrecombinant shuffled HIV clones are thus constructed. The 
library is propagated and amplified in £. coli to obtain DNA for transfectioh. 
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. . Tissue Culture . ' 

■The library is initially transfected into cells permissive for HIV growth 

such as.CEM. Supematants from this transfection are then used to. infect target cells 

(mouse, CD4- etc) The target cells are passaged for some time .( 2-3 weeks) to allow the 

spread of any viable clones. Also, a mixture of a minority of permissive cells c^ be 

cocultured with a large excess of target cells to allow a basal leyel of replication, of HIV 

strains: This allows any defective, but viable, strain of evolved HIV to slowly spread and 

also to allow further recombination of HIV to take place, at a basal l^^^^ 

Monitoring of Vinis Spread . . 

This is achieved'with assaying for reverse transcriptase activity in the . 
supernatant, release of HIV antigens by ELISA and Westerns arid.the use of reporter cell 
lines containing integrated reporter genes- (GFP ,, B-gal) under the control of the HIV 
LTR. PGR of genomic DNA from infected target cells is also performed. 

Recoverv of Subgenomic Sequences , ' ' ■ . 

When virus spread is detected in a culture of target cells after several 

passages, PGR of genomic DNA from these cells is ■performed to recover the HIV - ; 

, subgenomic sequences. , • 

Sequences from independent cultures are pooled and subjected to ftirther . 

rourids of shuffling, and selection. , ■ ^ 

FYAMPT.E 2: SHT TFFT .TNG CjP RFTROVIRAL G F.NOMES OF MIIRINE LEUKEMIA 

' Viruses ■ ' , 

• , . . ■ Multiple natural strains of MLVs are shuffled to evolve new strains that 
have novel phenotypes not possessed by any of the parental strains. Subgenomic 
fragments of , the different parental MLV genomes, are shuffled and subcloned into an 
25 infectious backbone based on the Moloney MLV genome. Libraries of potentially 

infectious clones are created and screened for novel phenotypes: The initial phenotype 
screehed'for is the abiUty of shuffled clones to infect and' ^^ow" on -several hamster and 
murine embryonal carcinoma (EC) cell lines that are refractory to irifection by any of the 

parental lines. - 
3Q This example demonstrates the effectiveness of DNA shuffling of viral 

genomes to evolve retroviruses to have desired characteristics useful in gene dehvery 
applications. These characteristics will include the ability to iiifect: a) specific cell types 
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efficiently e.g., hematopoietic cells; b) non-replicating cells; c) more stable retroviral 
particles; d) higher titers; and e) site specific integration of delivered genes. 

It also demonstrates the feasibility of building a general library fi-om which 
different novel phenotypes affecting different parts of the viral life cycle can be selected. 

5 MLV Strains 

Without being exhaustive, all or subsets of MLV strains listed below are 

used for family shuffling. ' 



Strain/isolate Class,.. 

. 1) Friend Eco 

10 2)Rauscher Eco 

3) 292EC1. 15 \ Eco . 

4) 292A Ampho 

5) Mo-Ampho ^ , Ampho 

6) Moloney' Eco 

15 7) AKR Xeno?MCF 

8) Gross Eco 

9) Balb V2/BC 169 ' Endo/Xeno 
, 10)Balb Vl/BC 194 Endo/Xeno 
^11) Balb V2 /BC 1 77- Endo/'Xeno 

20 ^ 12) AT 124 Xeno 

13) NZB cl 15. Xeno 

14) AKR 13 MCF 

15) AKR 247 MCFSource 



Proviral DNA 

25- The Moloney MLV was obtained as an infectious proviral clone. AH the 

other strains were obtained as biological clones or preparations and used to infect Mus 
Dunni cells. Genomic DNA firom infected Mus Dunni cells was used as template to PGR 
, . subgenomic^fi-agments of the yarious MLV strains. These subgenomk ,fragme^^^ are thus 
cloned into bacterial Bluescript plasmids. 

30 Shuffling 

A 3 kb region encompassing the 3' 500 bp of pol, the entire env and 3' 

LTR constitute the subgenomic fragment to be shuffled. The DNA used for shuffling is 

derived from in vitro PGR using plasmid clones as templates. Mixtures of PGR 

amplified fragments from all or subsets of the 15 strains are used in the shuffling process. 

35 The fragments are Dnase digested into 400- 1 .5 kb size range. These are reassembled by 
30- 45 cycles of annealing and extension without primers. The assembled mixture is 
then amplified with PGR. These shuffled fragments are then subcloned into the Moloney 

~" backbone and'ti^Esformed imo ErcoliTibra^ Other subgenomic regionsre:g;rthe gag 
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region may also be shuffled for the selection of specific phenotypes (e.g., infection of , 
non-repiicating cells). The whole genome, ~ 9 kb, may also be shuffled either in one 
large segment or as smaller segments separately which are later linked back together into 
the full length genome. 

Screening . 
' - . The plasmid libraries are transfected into cells permissive for retroviral 

replication (at present 293 human embryonic kidney cell line) for the iriitial production of 
viral particles: Supernatants from these transfections are transferred to the target cells for 
selection for<the ability to infect and replicate. Serial passages of these exposed target 
cells will be performed to allow the infection to spread. Marker rescue, reverse 
transcriptase, and genomic PCR assays are used to monitor whether a productive 
infection has taken place. ' 

' , Assays " '■ . ' ■ , ■ ■ • , - ' - , ' 

• ^ Marker Rescue . : . ' ' . 

■ • Supernatants are passaged through cells contaihing an integrated P- 

gal/Neomycin resistance vector. These .will then be used to infect target cells which Avill 

then be assayed for the expression of the marker(s). 

, . Reverse Transcriptase ■ 
. .. ■ Supernatants are tredted to pi-ecipitate and lyse retroviral particles. The 

extract is then subjected to an ELIS'A based assay for reverse transcriptase activity. The 
lysis treatment (detergent) is expected to inactivate .any infectious particles. ; 

■ Genomic DNA PCR , . ' . , " ' 

Cells putativelyinfectedwith retrovirus are harvested. A detergent lysis, 

salt precipitation and alcohol precipitation procedure is used lo extract genomic.DNA and 

inactivate any infectious iriaterial. This DNA is then used as, template for the PCR.of , 

su\)geriomic fragmehts ofpr6viral DNA. The preserice of a spreading infection on the 

target cells is indicated by the above assays. Genomic sequences of the optimized virus 

is recovered either by PCR amplifying the cellular genomic DNA of the infected cells or 

by RT-PCR of retroviral particles recovered fixjm the supernatant of the infected cells. 

Secondary Shuffling 

Subsequent rounds of shuffling are performed on recovered subgenomic 
fragments similarly to the first round. 
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Characterization of Desired MLV Clones 

Recovered infectious MLV clones with the desired phenotypes are 
characterized by traditional techniques such as titers, Westerns for viral antigens, reverse 
transcriptase activity, tropism, mapping and sequencing. ^ , 



Cell Lines Used 
Name 

3T3 

SC-1 

Mus Dunni IIIc 
P19 

F9 

Mv-1 Lu 
293 

CHO-Kl 
BHK-21 

DonCHL 
PA317 



Description 
Murine embryo 
Feral Mice 

Feral mice tail fibroblast 
Mouse embryonal carcinoma 
Mouse embryonal carcinoma 

Mink Lung 

' Human kidney embrv'onic 
Chinese Hamster Ovary 
Baby Hamster kidney 

. (Syrian) 
Hamster 

Retroviral packaging; 
amphotropic/ 3T3 



EXAMPLE 3 : CREATION OF ADENOVIRUS HOST RANGE MUTANTS BY DNA 
SHUFFLING 

Tissue tropism is a problem that exists in most of the viral vectors. For 
example, retroviruses do not target specific cells and only integrate into dividing cells. ^ 
Ad2 and Ad5 infect most human cells but do not infect or propagate in l>TOphocytes, 
keratinocytes, and hematological malignant cells. The host range determinants of Ad 
infection include viral and host factors. Cells must have Ad receptors (still unknown) 
and integrins in order to be permissive for Ad infection, and the Ad viruses must have 
appropriate fibers, penton base, and early genes in order to infect and propagate in the 
cells. It has been previously shown that by infecting nonpermissive cells (Vero) with a 
high MOI of Adl2 and continuously passaging the infected cells for many weeks, an 
adapted Adl2 mutant with altered host range (grows well in Vero) can be isolated. By 
^shuffling the viraLDNAvthis adaptation processTis-facilitatedrand-new-host-range-muta^ 
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are isolated. Human adenovirus group C propagate poorly in human lymphoma and ' 
myeloma cell lines, therefore, these cell lines can be used as host targets in the isolation 
of host range mutants. Ad viruses which can propagate efficiently in hematological 
malignant cells are evolved by recursive seqiience shuffling. 

Method " , 

Shuffling Group C Ad viruses (serotypes 1,2, 5, and 6). 

Group e adenoviruses were chosen because.that they are not oncogenic m . 

animal models. The genomes of these viruses also share more than 90% identity, ' 
allowing more efBcientDNA shuffling (recombination) in vitro. The entire genomic 
sequence ofAdS is shuffled using error prone PGR. Adl , Ad2, and Ad6 sequence ^ 
fi-agments are spiked in to increase diversity. . 

■ ' Testing ofwld type Ad infectivity' in leukemia/lym^^^ 

cell lines' '■■ ^ i . 

Marginal permissive cell lines are chosen as the host targets. Complete 

nonpermissiveness may indicate lack of the cellular receptor or integrins, andmay thus 

cause difficulties.in creating a mutant adenovirus that can compensate these ^ ' 

shortcomings. 

Infect cell lines with Ad mutants created by shumng: screen for mutants 

■ which give higher titers. : , ■ . ; ' 

Infectivity of Ad mutants in target cell lines is evaluated by isolating the 

• viruses at different time points after infection and determining their titer by plaque assay 
on a permissive cell line. Mutants with increased infectivity a^e pooled and subjected to , 
recursive cycles of. shuffling and screening to obtain mutanis witli greatly enhanced 
infectivity. 

" ■ Characterizations of the isolated host range mutants , 
. . Molecular characterizations, such as mapping, sequencmg, analyses ot 
viral DNA replication, transcription, and protein synA^^^ Ad viral 

genomes encoding altered host range and cell tropism are obtained. 

t:v ampt F A: EVQt t tttON OF HEP ATTTTS R VIRAL GKNOMF, FOR HUMAN 
30 HFPATOrYTETROPISM^ ^ ^^^^^^^ ^^^^^^^ ^^^^^ ^.^-^1 

genomes for encodmg the phenotype of infection "and propagation in human hepatic tissue 
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curture cells and in woodchucks. These systems will be especially useful for rapid 
screening and testing of new drugs. 

Hepatitis B virus (HB V) infection is the major risk factor in the 
development of chronic hepatitis and hepatocellular carcinoma (HCC). As much as 15% 
5 of the population is chronically infected in areas where this virus is highly prevalent such 
as in eastern Asia and sub-Saharan Africa. A large scale epidemiology study has shown 
that approximately 40% of the male HBV carriers will eventually die of HCC. 

None of the established cell lines is susceptible to infection of HB V 
derived from serum , or produced by HBV-producing cell lines. HBV can only infect 
. 10 primary human hepatocytes and the hepatocytes of chimpanzee. Thus, chimpanzee, an 
• endangered species which is expensive and allows only limited experimentation, 
represents the only available animal model. There is a woodchuck hepatitis virus 
(WHV) which is homologous to HBV and causes chronic hepatitis and HCC in the • ' ' 
woodchucks. However, the pathology of WHV infection in woodchucks is somewhat 
15 different from that of HBV in human or chimpanzee. Thus, the availability of permissive 
cell lines and small animal models, in which HBV can infect and propagate, would be 
valuable for the testing of therapeutic vaccines and drugs. 

The HBV replication cycle involves multiple steps, including virus 
attachment and entry, formation of covalently closed circular DNA, transcription, RNA 
20 packaging and reverse transcription, (+) strand synthesis, and viral assembly and release. 
Many of these steps involve interactions between HBV genome/gene products and those 
of the host. cell. Therefore, the inability of HBV to infect and replicate in culturable 
- human cells and in woodchuck may be caused by multiple blocks, and the number of - - 
mutations required to generate a mutant capable of replication in nonpermissive cells can 
25 be large. This possibility is also suggested by the fact that, despite intensive research in 
this field, so far no such a host range HBV mutant has been isolated. DNA shuffling is 
uniquely suited to obtaining novel mutants with complex genetic compositions which 
require multiple combinations of mutations or existing alleles. Therefore, DNA shuffling 
may be a promising approach to solving the problem of evolving HBV to grow in human 
30 hepatic cell lines and in woodchucks. 
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Method • : ■■ . „ , ' 

irovTas a genome size of 3.2 kb, which is easily and effectively shuffled 

using an in vitro shuffling fonnat. In addition, numerous natural variants of HBV and 
. WHV, whose genomes are highly homologous, are obtained and shuffled to achieve a 
librar^ of increased diversity. This library is transfected into target cells. Viral infection 
and propagation are monitored by reverse transcriptase assay, quantitative PCR of the 
viral genome, and the accumulation of viral surf-ace or core antigen 
" selectiQnin woodchucks, theshuffled HBV library is delivered to liver 

adenovirus. AAV, or nonviral vectors. The propagation ofmutantHBV in liver cells is 
" monitored by the presence of HBV DNA and/or surface or core antigens in the serum, 
and by viral antigens and pathogenesis in the liver. Genomic DNA of positive viral 
clones is obtained by PGR from- the medium supernatant of,the tissue culture cells, or 
' from serum ofthe woodphucks. , Selected pools.of mutant HBV DNA is used for 
recursive shuffling/selection cycles to obtain mutants with improved phenotypes. The 
evolved mutant virus is backcrossed with wild type HBV to eliminate unnecessary and 
silent mutations and to retain the replication and pathogenic properties of HBV. 

t;yamPT.F. S: DlR ^rTFn FVOTlJTIQ NnF A NFW TROPTSM TN^ RETROVIRUS : 

- rMT.V^BYr>NA SHTJFFLING .■ . . . ' : 

DNA shuffling was used to evolve a new tropism in ecotropic munne 

)■ leukemia virus (MLV). A library of shuffled ecotropic envelopes clbnek into full-length 
proviral genomes was selected for the ability to infect CHOK^ A dominant clone 
■ rapidly emerged during selection containing an envelope that was a clear recombinant . 
between three ofthe parental sequences;' This recombinant envelope conferred infectivity 
for CHO Kl cells through a novel mechanism. Our data show that this recombinant 
i5 • envelope was formed as a direct result ofthe DNA shuffling process. ' ; ,. 
■ ■ • : _current,gene transfer vectors suffer^ f^^^^ 

clinical efficacy. . Efforts to improve viral vectors have focused on rationally designed, 
modifications of viral components. Such approaches demand detailed knowledge and • 
understanding ofthe limiting mechanisms. This knowledge is often incomplete and as a 
30 result, these approaches have achieved only limited successes. For exarhple, attempts to 
develop targetable retroviral vectors by incorporating ligand binding domains into the 
envelope result in vectors that can bind specifically but are inefficient at entry (Kasahara, 
N.,Dozy,A.M. &Kan,Y.W. Science 266, 1373 - 1375 (1994)). On the other hand, 
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viruses and particularly retroviruses are notorious for their ability to evol /e their way 
around biological blocks. This process has been utilized in many studies to evolve 
viruses with new phenotypes such as expanded tropism (Vahlenkamp, T. W. et al. 
Journal of Viroloev 7 U 7132-7135 (1997) Taplitz, R.A. & Coffin, J.M, Journal of 
5 Virology 71, 7814-7819 (1997). drug resistance (Balzarini, J. ,et al. Journal of Virology 
67, 5353-5359 (1993) Dianzani, F. et al Antiviral Chem. Chemother. 4, 329-333 
(1993) and promoter activity (Barklis, e., Richard, M. & Jaenisch, R. Cdl 47, 391-399 
(1 986)). Components of evolved viral variants, for example LTR elements have been 
incorporated into improved viral vectors (Robbins, p.B. et al. Journal of Virology 71, 
10 9466-9474(1997)). 

Adaptation of viruses to new host ceils typically requires prolonged 
passaging and selection. This is due to the necessit>' for the continuous generation and 
selection of variants before an effective solution is found. Usually this involves only a 
few mutations. Biological adaptations that require constellations of mutations or novel 
1 5 combinations of functional domains may not be achieved without long periods of 
replication and frequent extinctions. In this example, we demonstrate that DNA 
shuffling can dramatically accelerate viral evolution towards desired phenotypes by 
* enhancing recombinatoriai processes in vitro. 

In DNA shuffling (e.g., .Stemmer, P.C. Nature 370, 389-391 (1994)), ' 
20 similar input sequences are first randomly fragmented. The firagments are then 

reassembled through multiple cycles of self-priming polymerase chain reaction. Because 
of the complementary overlapping ends, a fi-agment fi-om one parental sequence can 
prime off a teitiplate from another parental sequence. DNA shuffling thus generates a - 
population of recombinant sequences which is then screened or selected for improved 
25 phenotypes. The process can be applied recursively to independently selected sequences 
to recombine useful variations, often with synergistic effects. The diversity of the input 
parental sequences can be generated by mutagenic processes or, more effectively, by 
using several natural occurring sequences (Crameri, A., Raillard; S.-A., Bermudez, E. & 
Stemmer, W.P.C. DNA Nature 391, 288-291 (1998)) (natural* diversity). DNA shuffling 
30 thus accelerates natural processes of evolution by the rapid and efficient generation of 
diversity through errors and recombination, followed by selection. Many single and 
multigene systems have been dramatically improved using this process (Patten, P.A., 
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Howard, RJ. & Stemmer, W., P.C. Current Opinion in Biotechnology 8, 724-733 
(1997)). Here we applied the shuffling process for the directed evolution of a new 
tropism in MLV. Using envelope sequences from parental MLV strains that were non- 
infectious for CHO KL(Chinese Hamster Ovary) cells, we were able to rapidly evolve a 
5 chimeric envelope that conferred infectivity for these cells. This chimeric sequence 
represents a novel solution that is different from any known MLV strains that can infect 
CHO Kl . Our results underscore the abihty of DNA shuffling in viral systems to find , 
novel solutions where understanding of the biological limitations is incomplete- ; 

Parental MLV Clones 
10^ Six ecotropic MLV envelope sequences were used as the parental 

, sequences. These were of Moloney, 292E and four Friend (#2,7,9,21) clones. The 

Moloney clone was obtained as the infectious proviral clone pNCA. Thisplasmid was 

modified to have a unique NotI site just do wmstreain of the^'LTR. A 3,2 kb fragment 

. . encompassing about 0:5 kb of pel;, the. entire env and^^^ 

15 with analogous -fragments from the other strains;..* Friend and.292E . sequences were . 

' amplified with appropriateprimers-froniiinfectedMus Dunni genomic. DNA; to generate;;. 

the analogous fragments. Thus, infectioxxs MLV clones were reconstituted, consisting of 

5' Moloney sequencesiand '3' sequences from the other strains.* Table i'v shows the v 

infectious activities ofthe.six parental. clones; after transfection int6:293^ cells .contaim^ 

20. anihtegrated G4 18 resistance vector. (293/Gl): All; six parental viruses didmot infects 

CHO Kl cells and had clearly different 'infectious profiles' on the. various cell types. 

The four Friend clones were isolated from the same Friend biological complex and their 

different infectivities reflect the presence of multiple species in the original stock, This 

was borne oiit by subsequent sequencing of the envelope genes, which confirmed that the 

' 25 . Friend clones were related but distinct to one another. Friend 2,7 and 9 are more closely 

related and may have resulted from the sequential accumulation of mutations while 

Friend 21 is further diverged. 

Library Construction and Characterization 

Sequences between the Sfil and NotI sites of the parental clones were 
30 amplified and shuffled together. Recombinant sequences were then cloned back into the 
backbone vector to generate a library of approximately 1 X 10^. Several independent 
clones were picked and analyzed by restriction analysis as described. Eight out of 24 
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clones exhibited patterns different from any of the parents. This represents a lower limit 
for recombination frequency as many other nucleotide changes may not be detected. To 
assess the viability of the library, 5 pools of 4 clones each were transfected into 293/Gl 
cells. The viral supematants were tested for the ability to transduce G4 1 8 resistance into 
5 3T3 and Mus Dunni cells. Four of the 5 pools were able to strongly transduce G41 8 
resistance into at least one of the cell types. Thus, if each positive pool only had one 
infectious clone, this would give a frequency of 20% (4/20) which represents a lower 
limit for the viability of the library. 

Passaging of Library / Selection 
10 Selection was performed by passaging the shuffled' library supernatant on a 

mixture of CHO KI and Lec8 cells as illustrated in Fig 7 and described supra. A control 

mixture of the six unshuffled parents were passaged identically. A small proportion of 

Lec 8 cells was mixed in during passaging to support a low level of replication in a 

permissive cell type that was as similar to the target CHO Kl cells as possible. Lec 8 

15 cells are CHO Kl-derived mutants whose ecotropic receptors are believed to be more ^> 
accessible because of a defect in their glycosylation pathways. This renders them ftr 
permissive to infection by som.e ecotropic MLVs (See also Wilson, C, & Eiden, M.V.E. 
J. Virol. 65, 5975-5982 (1991); Miller, D.G. <& Miller, D. J. Virol. 66, 78-84 (1992). 
Wang,H. et al. J. Virol. 70,6884-6891 (1996)). Friend.2, Friend 9 and Moloney 

20 MLVs produced from transfected 293/Gl are able to infect Lec 8 cells fairly efficiently -^r 
(Table 1). 

Table 2 shows the progress of the selection for both the control unshuffled 

" ' parents and th'e shuffled library.' By titering at each stage, the changing 'infection profile' 

of the viral population was monitored. The initiating transfections into 293/Gl for the 
25 shuffled library produced supematants that gave titers on 3T3, Mus Dunni and Lec8 that 
were on the order of 10^ fold lower than that for the control parental pool. 

' The infectious activities of both the control parental pool and the shuffled 
library fell to similar levels after one passage on the coculture cells, even though the 
shuffled library started out with 10^ fold lower titers. This indicates that the shuffled 
30 library is actually fitter than the pairental pool imder the coculture selection conditions. 
This point is underscored after a second passage of the viral pools. The parental pool 
essentially becomes extinct after the second passage (extremely low activity can be 
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deteeted pccasionally). The library titers fell to low but detectable levels that remained 
stable through passage 3. At passage 4, when the level of permissive Lec 8 was raised 
from 3% to 1 0%, the. titers rose. Low GHO K 1 infectious activity became detectable at . 
. passage 4 and this increased at passage 5 with the^ ri^^^ 
5 Another particularly informative observation can be made by comparing 

the * infectious profiles' of the control parental pool and the library pool at various stages 
of the selection. The infection efficiencies of the control parental pool after transfection . 
on Mus Dunhi and Lec 8 relative to 3T3 are approximately 1 0-2 and 1 0-3 respectively . \ 
Thisremains relatively constant through passage 1 beforethepopulation 'crashes* at ' ; 

10 passage 2. F6r the libraiy pool, efficiencies for Mus Dunni and Lec 8 relative to 3T3 
increased roughly 10 fold for each passage. From passage 2 onwards^ these ratios ' - 
; approached L It fell back to 10-1 again atpassage S'B when the overall .titers inqreases. 
This implies that essentially every viral particle that can infect 3 T3 at passages 2- 4 can 
infect Lec 8 equally efficiently whereas in the unselected initial u-ansfection pool, there is 

15 only one viral particle that can infect Lec 8 for every 300 particles that can infect 3T3 . . 

Thus there is strong selective pressure for the ability to infect and re Lec 8. The-/ 

parental pool apparently cannot adapt rapidly enough to this selective pressure. The ^ 
evolved virarpopulatipn^surviving the selection has-a low level of inf 
CHO.Kl cells.^ Thus infection of CHO^Kl cells becomes observable only after the 

20 / populatidn recovers from the selection diuirig th^ . 

Recover/ and Characterization and Recombinant Clones 
Genomic DNA was prepared from the coculture cells from various 

passages. This was used as template to amplify provirai sequm^ 

' Methods. Genomic D^f A prepared from passage 5 cells which gave significant CHO Kl 

25 Wectibus activity, produced.a clear product band of the expected 3'.1 kb size. No specific 
product band was observed for passages 1-3 (passage 4 not performed) and for any of 
control parental passages. The PGR products amplified from passage 5 cells were pooled 
and cloned into the Moloney backbone to reconstitute potentially infectious clones. PGR 
Augments corresponding to the shuffled region were ampUfied from individual clones and 

30 screened by restriction digestion. The restriction patterns of none of the clones 

correspond to any of the parental patterns. Furthermore there is a dominant pattern 
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(Clones 1-6, 8, 10-12) that represents the 'master sequence'. Clones 7 and 9 are shghtly 
different from the dominant pattern but are also distinct from any of the parents. , 

Clones 3, 10 and 1 1 , corresponding to the dominant pattem and the variant 
clone 7 were transfected into 293/Gl cells, and the supematants were tested for infectious, 
5 activity (Table 3a). Surprisingly, all of these clones had drastically diminished 

infectivities for CHO Kl when compared to the passage 5 pools from which they were 
isolated. Relative to titers on Lec 8 cells, the infectivities of these clones for CHO Kl 
was on the order of 10'^ or less, 100-1000 fold lower than that for passage 5 supematants. 
This suggested that the *CHO-tropic' clone in passage 5 was not represented by any of 

10 the four clones tested. The infectious efficiency of passage 5 supematants on CHO Kl 
relative to the other cell types is about 10-3-10-2 (range from several subsequent 
titrations).. This could be interpreted in two ways: 1 ) The predominant virus particle in 
this supernatant can infect CHO Kl at an relative efficiency of lO'^-lO'^ 2) there is one ■ 
viral particle in every 100.-1000 infectious particles that can infect CHO K\,' If the latter f; 

15 were true, this rare clone would be expected to be selected for under our passaging regime j: 
and increase in frequency. However CHO Kl infectious efficiency apparently has 
stabilized at 10"'*-10*'^ suggesting the viral population has achieved som.e state of 
'equilibrium'. This is supported by the clear dominance of one clone as shown by . ^ 
restriction analysis. These observations indicated that the clone that conferred CHO Kl 

20 infectivity was not missed, but that this activity was masked in our clones. ^ f 
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Table 2: Titers of parental and shuffled library passage supernatants on coculture cells 





Titer 


Transfect 






Coculture Passage Number 








cells 


- ion 


1 


2 


3 


4 


5 






3T3 


1.2x10' 


7x10' 


0 


0 


0 


0 


- ND 


Pnnfrnl 


MD 


3X10'' 


4x 10' 


0 


r\ 
\J 


u 


ND 


ND 


IPjirpntn]*; 




















Lec8 


1.4 XIO** 


30 


0 


0 


8 


0 


ND 




CHOKl 


0 




0 


0 


0 


0 


ND 




3T3 


3x10' ,. 


1x10' 


16 


7 


10' 


2x10' 


10' 


Shuffled 


MD 


10' 


4 X 1,0' . 


20 


30 


45 


ND 


• 10' 


Library 




















LecS 


'9X10'" 


40 


19 


' 19 


■ 10' 


>5xl0' 






CHOKl 


0 


0 


0 


0 


10 


5x10' 





MD : Mas Dunni * from sepancc expcfiment which gave comparable :i:cn on 3T3 
* ND : not done & supcmatanu from laccr cultures split from passa^: 5 



Table 3a : Diminished CHO KlTnfectivity after Growth in 293/Gl cells 







Titer Cells 




CLONE # . 


3T3 


Mus Dunni 


LecS 


CHOKl . 


3 


10' 


10* 


10* 


10* 


' • 7 


-50-- 


.-..-10^. 


10' 


" 0 , 


— —10 — - 


. - - 10'— 


_.__10^ 


10^ • 


b 


11 


• 10' 


■ 10' ~~ 


"lo' 





esc 



^Tiated'from.a single G418 resistant colony in the IC' titration well. . 

^■u rrrrr^^tr^ is reconstituted after passage through LecB/Gl cells 
Table 3b: Lrtu . - 



CLONE # 



3T3 



Titer Cells 
Mus Dunni .Lec8 CHOKl 
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Lec 8 Passaging Effects 

Lec 8 cells were included in the coculture as permissive cells to allow 
some level of viral replication. They have a known defect in their glycosylation pathway 
(See e.g., Stanley, P. & Siminovitch, L. Somat. Cell Genet. 3, 391-405 (1977)) that 
apparently 'unblocks' their ecotropic receptors (Wilson, C. & Eiden, M.V.E. J. ViroL 
65, 5975-5982 (1991); Miller; D.G. & Miller, D, J. Virol. 66, 78-84 (1992); Wang, H/ : 
etal. J. Virol. 70,6884-6891.(1996)), We wondered if the mechanisms that unblock , the 
Lec8 cellular receptors might also operate on virus that are passaged through these cells. 
To mvestigate this, 293/Gl transfection supematants from the viral clones described 
above were used' to. infect Lec8/Gl cells. This was replaced with fresh media which was 
collected after 48 hours and titered (Table 3b). All clones except #7, had their CHO Kl 
infectivities reconstituted to levels similar to passage #5 supematants (CHO Kl infection 
efficiency ranges between 10'^ . -lO'^ due to experimental variations). This suggested that 
diere was a requirement for LecS specific processing of vims before CHO Kl infectivity 
was manifested. Clone #7, which had a different restriction pattern from the dominant : 
pattern, may represent a 'passenger' virus or may have diverged further from the 'master 
sequence' and lost the abihty to infect CHO Kl. Clones 7 and 10 were not analyzed 
further. The erivelope sequences for clones 3 and 1 1 were 

. From Table 1 , it can be seen that the Friend 2, Friend 9 and Moloney MLV 
parents can infect Lec 8 fairly well Thus, it was interesting to determine why the control 
parental pool passaged virtually to extinction so rapidly. To examine this, these parental 
clones were transfected into 293/Gl , the supematants from these were then passaged 
either through Lec8/Gl or permissive murine lines (Mus Dunni/Gl or SC 1/Gl ). The ■ 
superriatants from these yarious stages were titered , (Table 4). Friend 2 and Moloney * 
MLV produced high titer virus on 3T3 after passage through Lec 8. However their 
infectivities on Lec 8 relative to 3T3 were diminished by 100-1000 fold when compared 
to the 293/Gl transfection supematants. Friend 9 either not did replicate well in Lec 8 or 
produced poorly infectious viruses. Supematants from Mus Dunni/Gl or SC-l/Gl 
passages essentially preserved the infectious profiles of the initial 293/Gl transfection 
supematants. As expected, none of the parental supematants could infect CHO Kl 
regardless of how they were passaged. Thus, the parental viruses lose the ability to 
reinfect the permissive LecS cells with each passage. This would lead to the t -•^ 
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and rapid dilution of virus production and to the observed rapid decline of the infection 
during passaging. 

Sequences of Recombinant Envelope 

The envelope sequences of Clones #3 and #1 1 are most consistent, with a 
5 four fragment recombination between three of the Friend parents (Fig. 8). The sequence 
of clone #3 can be explained by recombination alone, while that for clone#l 1 has an 
additional silent base change at position 23 1 . Nucleotide differences between the parents 
allow us to map the regions where crossovers took place. It is not surprising that the 
Moloney and 292E sequences were not included in the selected clones. Recombination 

10 events involving these two parents may be under-represented as they have lower degrees 
of identities with the Friend sequences. Because of their greater divergence, 
recombination events may also have a higher probability of generating non-viable clones. 
Although the 3' LTR and parts of pol were also shuffled, it is unlikely that they play . 
significant roles in the new tropism of the recombinant clones. Pol is highly conserved 

15 between ecotropic ML Vs and is not known to have a role in entry. Cloning of 

recombinant envelope sequences which excludes the 3' LTR, using the Sfi I site in pol 
and a conserved Cla I site towards the end of the envelope is sufficient to confer CHO Kl 
tropism (data not shown). This indicates that the changes in the LTR were not necessary. 

DNA shuffling was used to improve individual genes as well as multigene 

20 pathways. In this example, we report an appHcation of shuffling to evolve a desired 

phenotype in a viral system. The ability to infect CHO Kl cells was evolved by shuffling 
sequences from a defined set of ecotropic parental MLVs. No a priori assumptions were 

- — made ofthe changes required to overcome the CHO Kl entry block other than tliat the - 

~~^^en\^lopCw2^^^ " ~ - ■ ' ^ 

25 Predominantly, envelope sequences from the six parents were shuffled to 

generate a library of about 1 X 10^ clones. At least one third of these were recombinant. 
This shuffled library consistently gave 100 fold lower titers than the parental pool upon 
initial transfection into 293/Gl cells. This is caused by the generation of many lethal and 
debilitated sequences by the shuffling process. Thus the fitness of the naive library is 
30 lower than the unshuffled parental pool This reflects the 'cost' ofthe shuffling process in 
generating diversity at the expense of population fitness. 
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An initial attempt to select directly on CHO Kl cells alone le^ 
extinction of both the library pool and the control parental pool. This indicated that the 
barrier to CHO Kl infection was high enough that an effective solution was not 
sufficiently represented in the initial library. There might have been variants that could 
5 infesct CHO Kl cells, but these were either too rare or were too inefficient that they were 
easily lost during the stringent direct selection. ' To enrich for these possible variants, we 
mixed in a low proportion of permissive Lec8 cells. These cells were chosen as they 
were CHO Kl derived mutants and thus would serve as a relevant 'bridge' to the target' , , 
CHO Kl cells themselves: With the coculture selection, the titers of both the parental 

10 pool and shuffled library pool fell to similar levels after 1 passage, despite the 100 fold 
higher initial titers of the parental pool. By the second passage onwards, there was little 
detectable infectious activity in the parental' pool while in the library pool, infectious ; 
activity was low but clearly present. Thus there were clones in the library that were ' - 
: surviving the selection as a result of the added diversit) generated by shuffling. CHO K l 

15^ infectivity was observed from passage 4 onwai'ds as the overall . titers increased. 

Prpviral envelope seqn^nces cloned from passage' 5 coculture genomic 
DNA revealed a dominant recombinant patterri whose parental origins can be deduced 
(Fig 8), Three of the Friend parents contributed to this recombiiiant enve^^ 
structure is different from any of the pubUshed MLV 'related stradns that can infect CHO 

20 ;K1 . ' These include the 1 OA 1 strain the Gibbon Ape Leukemia Virus (Delassus, S., 

Soiiigo, P. & Wain-Hobson, S. Virology 173,, 205-213 (1989)), and the neuropathbgenic 
Friend variant PVC 21 1 (Masuda, M., Masuda, M., Hanson, C, Hpffinan, P.M. & 
Ruscetti, S.K. J. Virol. ' 70, 8534-8539 (1996)). The two residue changes in PVC21 1 
(ET16G and E129K; numbering according to reference) relative to the published Friend . 

25 . 57 clone do not occur in our recombinant clones or in any of the parents. A Mus Diinni 
Endogenous Virus (Bonham, L., Wolgamott, G: & Miller, A.D. J. Virol. 71, 4663-4670 
(1997)) is also highly infectious for CHO Kl but based on its source and properties is not 
related to our recombinant clone. 

' An* interesting feature is the requirement of our recombinant clones to be - 

30 passaged through LecS cells for it to acquire CHO Kl infectivity. When the clones were 
repUcated in 293/G 1 cells, their infectivity for CHO Kl relative to Lec 8 is 1 0-5 or lower. 
After passage through Lec 8, this increased 1 00-1000 fold to 10-3-10-2. As a result of a 
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mutation in the glycosylation pathway of Lec 8 cells, their golgi can only import 
galactose at 2% of wild type levels, resulting in low efficiency of terminal addition of 
galactose and sialic acid at N-linked glycosylation sites (Deutschert, S.L. & Hirschberg, 
C.B. Mechanism ofGalactosylation in the Golgi Apparatus. J. Biol. Chem. 261,96- 
5 1 00 (1 996). An altered glycosylation pattern of the envelope when expressed in Lec 8 
may be responsible for enhancing CHO Kl infectivity. Glycosylation patterns of 
retroviral envelopes produced in different CHO glycosylation mutant cell lines are clearly 
different (Fenouillet, E., Miquelis, R. & Drillien, R. Virology 218, 224-231 (1996). 
Friend 21 is more divergent than any of the other Friend parents. In the segment that 

10 Friend 21 contributes to the recombinant clones, three amino acid residues (378, 413 and 
447; Fig. 8) that are specific for Friend 21 are positioned 1-3 residues away from N- . 
linked glycosylation sites. These may influence the efficiency of sugar addition which 
may in turn affect the overall conformation of the envelope. Cellular processing and 
conformation of retroviral envelope glycoproteins are known to be heavily dependent on 

15 glycosylation signals. The receptor binding domain (Heard, J.M. Danos, 0. J. Virol. 
- 65, 4026-4032 (1991)) of the recombinant envelope is provided by Friend 2 and Friend 9 
parents, both of which can infect Lec 8 cells. It may be that this receptor binding domain 
in juxaposition with the aUered glycosylation signals from Friend 2 1 is processed in Lec 8 
cells to produce an envelope that is able to reinfect Lec 8 cells and to a lesser degree, to 

20 infect CHO Kl cells. The glycosylation mediated block of CHO Kl receptors can be 

reheved by inhibiting glycosylation in these cells. This may have the effect of making the 
receptors more accessible to the envelope. The same effect might also be achieved by 

under-glycosyiating the retroviral envelope itself This modification of retroviral 

" tropisms by altering the glycosylation pattern of envelopes may represent a no ve "~ 

25 mechanism that has not been reported previously. 

The passage of parental viruses produced from 293/Gl through Lec 8 
results in poor production of infectious viruses (Friend 9) or in progeny viruses that 
cannot reinfect Lec 8 efficiently (Friend 2 and Moloney). This may be a direct result of 
the altered glycosylation pattern of these parental envelopes in Lec 8 cells. Under- 

30 glycosylation.of the Friend 9 envelope may lead to gross misfolding while for Friend 2, 
and Moloney, this may lead to conformational changes that resuU in the inability of the 
envelope to bind the Lec8 receptor efficiently. The rapid abrogation of the parental 
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infection after two passages on the coculture cells can be explained by the progressive 
loss of the ability of the parental viruses to reinfect the permissi ve Lec8 cells. 
Recombination during shuffling between various parental segments generated clones that 
circumvented this restriction and werq therefore selected for. It is unlikely that in vivo 
5 retroviral recombination had a significant role in generating the selecteid clones. Friend , 
21, which contributes a substantial segment, cannot infect Lec 8, Thus, outside of the 
■ initial transfection into 293/Gl, there are extremely limited opportunities for the.other 
; Friend parental genomes to be in the same cellular compartment to be copackaged 
together. This is further exacerbated by the progressive loss of infectivities for Lec8 of 

10 the parents with each passage. None of these biological restrictions are applicable to the 
in vitro shuffling process, which can create multi-fragment recombinants in a single • 
reaction. This situation underscores the direct role of the in vitro recombination process 
in generating the recombinants. Thus, shuffling allowed novel recombinants to be created 
which natural mechanisms of retroviral recombination would not be likely to create. 

15 Ishimoto, A. Jounial of the National Cancer Institu^ . 

, was able to adapt Friend MLV strains to infect Syrian Hamster cells at improyed ' 
• efficiencies by continuous passaging. .The starting virus was obtained initially as in vivo 

^ ■ mouse passaged stocks comprising a 'diverse quasispecies' and already had,a low level of 
infectivity for'hamster cells/ Our starting viral stocks, were six defined molecular MLV 

20 clones which had no detectable infectivity for CHO Ki . From these we generated a 
' diverse library of recombinants in one round of shuffling and obtained a recombinant ; 
sequence'that could circumvent the GHOKl entry block. Thus, shufflirig allows high 
diversity to be achieved rapidly from a limited repertoire of starting sequences. The use 
of diverse but related parental sequences (each of which has evolved separately) in family 

25 shuffling enables functional sequence space to be explored efficiently and novel solutions 
.to be rapidly found. Further rounds of shuffle 

improvements. DNA shuffling thus is a useful tool to solve some of the current 
limitations in viral vectors where there are multiple variables and where detailed ^ 
knowledge of the limiting mechanisms is incomplete. 
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/ CellLines \^ -.^ 

Cell lines were obtained from AmerieanTy^ A 

retroviral vector expressing the G4 18 resistance marker (from Gene Therap^^ 

5 Laboratories, University of Southem Galifomia) was introduced into these cells which 

were then subjected to G4 18 selection at O.8-1 mg/ml. About 20-100 resistant colonies 

for each cell type were pooled. These G418 resistant lines are denoted with a 7G1 

suffix. 

Viruses - . . 

10 • Friend MLV (ATCG VR 245) was obtained as a spleen extract containing. 

a mixture of three viruses. An ecotropic 292E strain (ATCC 1 326) was obtained as a 

supernatant from infected NIH 3T3 cells. Genomic DNA from Mus Dunni cells infected 

with these stocks were used to recover proviral sequences of the different MLV strains 

(below). Plasmid pNCA (gift from S. Go ff, Universicy of Columbia) contains a full 

15 length , non-permuted, copy of the, wild type Moloney MLV proviral DNA in a pBR322 

based vector (Colicelli, J. c& Goff, S.P. J. MoL Biol. 199,47-59 (1988)). ; 

Cloning, of Envelope Sequences , , 

* Genomic DNA was isolated from Mus Dunni infected with Friend or the 

292 ecotropic (292 E) MLV strains using the Puregene kit (Gentra Biosystems) and 

20 manufacturer's protocols. Primers were designed to amplify Friend and 292E MLV 

proviral sequences based on the published Moloney MLV sequence (Genbank accession; 

number M76668). The upstream sense primer Mol PolESn straddles the Sfil site in the 

pol gene which is highly conserved between ecotropic MLV strains. The downs^eam 

___^antisense primer,.MolU5 as is p ositio ned at the 3' e n d of the U5 se quenc e. A NotI sit e is 

25 . also included in the 5 * tail of this primer(Fig. 1). PGR was performed using reagents 

from the GeneAmp XL PGR kit (PE Applied Biosystems). Final concentrations of Mg 

acetate, primers and each dNTP were 1.25 mM , 0.5 uM and 200 uM respectively. PGR 

u fragments from the 292E and Friend ampUfications were processed and eventually cloned 

into a modified pNC A (see below) acceptor backbone using the Sfil and NotI unique 

30 sites. PlasmidpNCA was modified by inserting aNot I site just downstream of the 3' 

LTR of the Moloney MLV sequence. A xmique Sfi site exists in the 3' region of the pol 

gene. Cleavage of the modified pNCA plasmid with Not I and Sfi I excises about 0.5 kb 

^ofp"ol7the~entireenvand"3'~LTR7~Theremmning backbone — 
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vector for the cloning of the analogous Friend and 292E MLV fragments. The pNCA ; 
thus provided most of theS' sequence of Moloney MLV while the 3' region of pol and 
the entire env and 3'LTR were replaced with Friend^and 292E sequences. Multiple fiiH- 
length Friend and 292E clones were then screened for infectious actm^^^ 
5 and four Friend clones that produced infectious Virus were chosen as parental sequences. 
The four Friend clones (# 2, 7,9,21) exhibited different relative infectivities on 3T3 and : 
/ Mus Dunni cells. 

Transfectiohs ^ 
, , : ' . Transfections were performed by calcium phosphate precipitation usmg 

D reagents from 5 Prime 3 Prime and manufacturer's protocols. 30 ug of full-length 
. proviral clones or library DNA were transfected into 5 X 106 2'^3/Gl cells (plated one 
day prior to transfectidns) per 100 mm tissue culture dish, fhe precipitate was allowed to 
settle on c^iis for 16- 18 hours before being washed off. Sodium Butyrate. (Sigma) was,, 
added in fresh Dulbecco's Modified Essential Media (Gibco) containing 1 0% fetal bovine 
5 ■ ,serum (FBS; Hyclone) to a final concentr_ation onOmM. After 6- 8 hours exposure, the 
• media was removed and replaced' with 5 ml.of fresh media.. This supernatant was ' 
cqlfected after 20 -24 hours and frozen. Another 5 ml of fresh media was added for a 
further 20 - 24 hours and a 2nd supernatant collection was;performed This was pooled 
, with the first collection, filtered through 0.45 uM . syringe fiUers (Gehnan) and^ for 
50 titrations or passaging. , 

' . , Viral Titrations bv Marker Rescue , , 
■ Cell tyiies used in transfections or passaging of infectious virus contam an 

integrated retroviral Vector expressing the G4 18 resistance marker. Thus we were able to 
monitor the infectious activity of fiiUy replicative MLV by titeririg viral supematants and 
25 selecting for 041 8 resistance. This also allowed us to compare relative efficiencies of 
: . - virus infection on varibiis cell types. Brieflj^ TO fbld^s^ dihitiohspf virus supematants 
in media containing 8ug/ml of Polybrene (Sigma) were allowed to infect cells on 6 well 
plates (plated one day prior at 2-3 X 105 cells /weU). After incubation at 37 C overnight, 
the supernatant was replaced with media containing I mg/ml of G41 8 (Calbiochem, La 

30 JoUa, California). Selection was continued for 7-10 days. Surviving resistant colonies 
were stained by with methylene blue/methanol and counted. 
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Shuffling of Pro viral Sequence and Library Construction 

These six clones were used as templates for PGR amplification to generate 

material for shuffling. MolPolESn and pBRas, an antisense primer in the pBR322 vector 

sequence just downstream of primer MolUSas were used to amplify a specific 3.2 kb 

5 product. PGR products from each of the six parents were purified and mixed together in 

equimolar amounts. This mixture was then digested with DNAse I (Sigma). DNAsed 

fi-agraents in the size range of 0.7 - 1.6 kb were purified and used in the shuffling reaction 

essentially as described before (Grameri, A., Whitehom, E.A., Tate, E. & Stemmer, 

W.P.G. Nature Biotechnology 14, 315-319 (1996)). the completed shuffling reactions 

1 0 were used as templates for preparative PGR using primers MolPolEsn and MolUSas. 
Products fi-om this were purified and digested with NotI and SfiL These fi-agments were 
then cloned into similarly digested modified pNGA acceptor backbone and transformed 
into XL-10 Gold competent cells (Stratagene). Approximately I X 106 colonies were 
obtained and pooled and used to prepare library plasmid DNA. Several independent 

15 colonies were also individually picked and analyzed. Fragments representing the shuffled 
region were amplified from these clones. These PGR fragments were digested 
simultaneusly with Bgl I, Gla I, Dra I, Dra III and Sac 11. The digests were run out on a 
1.5% agarose gel and compared to the restriction patterns of the parents. Glones were 
also assayed for viability. 

20 Librarv Passaging / Selection for GHO Kl Tropic Virus (Fig 2) 

' Library plasmid DNA was transfected into 4 plates of 293/Gl cells as 

described above. 40 ml of supernatant was collected. About 5 ml of this was used for 

titering while 10 ml (polybrene was added to 8 ug/ml) was passaged onto a coculture of 

_ CHOjy_/GI_(90%)jnd 

25 plate. The coculture cells were exposed to this supernatant for 24-48 hours before being 
replaced with fresh F12 Ham (Gibco BRL) media with 10% FBS. When the coculture 
cells had grown to 90-100% confluency, fresh media was added and left on the cells for 
48 hours. This supernatant was collected, filtered and used for titering and for passaging 
onto fresh coculture cells. As a control to account for natural recombination and 

30 adaptation, an equimolar mixture of the six parental clones were transfected, passaged 
and assayed identically to the library supernatant. 
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Sequencing ' ' ,' . 
Fluorescent DNA sequencing of MLV envelopes was performed using an 

ABI Prisin 310 Genetic Analyzer and reagents supplied in the BigDye Terminator Ready 
Reaction kit (ABI Prism). Raw sequence collection software(vl.0.4) and sequencing 
.analysis software(v3.G) was also supplied by ABI Prism. Sequencher v3.0 software 
(Gene Codes Corporation) was used for contig assembly and alignments. 

6: EVnt J ITTON OF HFP ATTTTS C VIRUS (HCV) TO GROW IN TISSUE 

CULTURE . ' ^ . ' 

Hepatitis C Virus (HCV) represents the predominant cause of transfusion 

' associated non-A, non^B hepatitis. Infection is usually chronic with an estimated 4 
million people in the United States and 100 million people worldwide that are chronicaily 
infected with HCV. It has a high propensity to cause chronic hepatitis, which could 
progress to liver cirrhosis and then hepatocellular carcinoma. There is at present .no ' 
vaccine for HCV and therapy with interferon is only 20-30% efficacious. 
, . ■ The chimpanzee repr^sents. the most reliable animal model, although their 

availability is extremely limited and disease in this species is generally mild. 
Development of vaccines and therapeutics has-been hampered by the lack of efficient 
replication of HCV in tissue culture in vitro. Although infection of some human tissue ■ 
.cultiu-e lineshave been reported, there is little evidence -for the ability of stable molecular : 
' clones to carry out complete replication cycles. , The value of evolving such infectious 
molecular clones of HCV that can replicate in estabhshed cell lines will thus be 
significant. It will greatly faciUtateiscreening for anti-viral therapeutics, allow for the 
establishment of neutralizing antibody assays and provide systems for vaccine 
development. 

' ■ The genome of HCV consists of a 9.5 kb positive stranded RNA with a 
single large opOT reading frame (ORF). This fimct^ from 
which all viral proteins are translated. The ORF codes for a large polyprotein which is 
cleaved co- and post-translationally to generate the individual viral proteins. Genomic 
RNA as well as RNA transcripts from full length cDNA clones should be infectious. 
However, are multiple blocks that prevent robust replication in vitro one of which is the 
inefficient translation of HCV transcripts. These blocks are addressed using DNA 
shuffling followed by selection in target cells such as human liver cell lines. These target 
cells include those that naturally express the HCV receptor, e.g.. CD81, as well as those 
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that'either that do not express CD81 or only express it at low levels. To tiihance 
expression of CDS 1 and thus facilitate infection of cells with evolved HCV variants, cell 
lines are optionally stably or transiently transfected with a CDS 1 cDNA expression 
vector. Cells lines that could be used in the screening, after transfection with CDS 1 , 
5 include, but are not limited to, Hela, Cos-1 . Cos-7, CHO, 293, U937, HL60, Jijoye, 
Jurkat, Hep G2, C3A, TF-1 , Baf-3. Methods for stable transfection are known to those 
skilled in the art, and are described for example by van der Merwe et al. (J. Exp. Med. 
185, 393-403, 1997) and Lanier et al (L Immunol., 154, 97405, 1995). ^ . 

Shuffling is performed on the entire genome of HCV or subgenomic 

10 portions or both. The size of the HCV genome is within the range of previous sequences 
that have been successfully shuffled (e.g. adenovirus,: with > 20kb shuffled). 
Furthermore, the genome of HCV is highly heterogeneous with the assignment of at least 
six HCV types encompassing 1 1 subtypes. The most divergent HCV isolates differ from 
each other by more than 30% over the entire genome. Sequence identities lower than this 

15 have been successfully shuffled (e.g. Cephalosporinase). Moreover, HCV, like many 
RNA viruses circulates as a quasispecies, further adding to natural diversity which can be 
harvested for shuffling. 

Protocol for Shuffling and Selection of HCV 

Prepare large quantities of genomic and/or subgenomic fragments of 

20 multiple species of HCV by PCR or by amplification in bacteria. These are obtained as 
full length or partial molecular clones, or from clinical samples. 

DNA shuffling is performed, including e.g., DNAse 1 digestion , PCR 
assembly, (e.g., a long range, high-fideUty PCR.protocol). The PCR can be performed 

- such that a promoter such as-T7 is incorporated at the-5- 

25 or subgenomic) are optionally cloned into a HCV genomic cDN A template with a 

promoter incorporated to reconstitute full length molecular clones. Runoff transcription 
is performed to generate libraries of potentially infectious transcripts. Pools of RNA 
transcripts are transfected into target cells. As noted above, target cells include those 
which express CD81, either naturally, or following transfection with a CD81 coding 
30 nucleic acid. Infectious sequences are recovered by PCR^ e.g., from virions or negative 
strain (replicated) RNA by RT-PCR. It is also possible to enrich or select for replicating 
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infectious virus by passaging infected cells or supematants on target cells. Shuffling and 
selection steps are repeated as desired. 

pvAMPTT^7: TOP nnw AND BOT TOM TTP SHTIFFtTNG OF HTV-1 TO CHANGE 

V TRAT.TROPISM i T^xTA u «!• 

^ll^ principal purpose of this example is to apply DNA Shuffling 

technology, or molecular breeding, to the problem of evolving a variant of HIV- 1 that can 
replicate in mouse cells and in transgenic mice expressing h^^ There is 

wide consensus in the field that there is a large unmet need for a small animal model for 
AIDS in which to test the many emerging sniall molecule therapeutics that have been 
discovered recently. Additionally, there is an enormous unmet need for basic research on 
HIV- 1 vaccines. The HIV-l variants that are evolved to replicate in hCD4+ hCCR5+ 
double transgenic mice add valuable new tools to meet these argent biopharraaceutical • 

■ and human health needs. • ■ . 

An additional goal of this exiunple i s to evolve HIV to replicate in 
macaque cells by methods similar to those being used tp evolve the virus to repHcate in . 
mouse cells. The motivation for this approach is that there are known to be fewer blocks . 
to HIV replication.in macaque cells (for example, transfected molecular clones produce 
infectious virus), and hence this is easier to achieve. Additipnally, the physiology and 
immune systms of macaques are more similar to the corresponding human sys 

are the murine systems, making macaques a good vaccine model; 

. The recent discoveries of the HiV coreceptors CXCR4 and GCR5 have 

removed the barriers to infection of murine cells by HIV- 1 . However, there remain 
blocks to HIV-l replication in murine cells. There is not sufficient diversity in available . 
HIV-l isolates to directly select a mutant virus that can replicate on the murine target 
cells (Harris Goldstein ahd'colleagues. PNAS 94:14637.14641, 1997)'. Consequently, a 
variant capable of serving as a mouse modelfor AIDS needs mu^^^ 
overcome these blocks. DNA shuffling is a method for improving single and multi-gene 
traits which require many mutations and this method has generated improvements ranging 
from 10 to 32,000-fold. Neither a pnor/ assumptions nor knowledge of the gene 
structures are required for successful application of gene shuffling. Thus, the technology 
is adaptable to the problem of adapting HIV-lto replicate in non-human cells. 
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If a virus is evolved or engineered to replicate in murine cells, it will have 
many mutations relative to wild type HIV-1 which may be unnecessary for replication in 
murine cells and which will compromise it as a valid model for AIDS. DNA shuffling 
provides a solution to this problem because one can backcross a mutant of interest with 
5 wild type strains. This natural feature of shuffling technology is used to perfonn in vitro 
backcrosses of evolved variants with wild type HTV-l strains of commercial interest. 
This step will ensure that only those mutations necessary for viral propagation in the ^ 
mouse are preserved, thereby optimizing the predictive value of this laboratory model for 
the human disease. These evolved viruses will be used in conjunction with the double 
1 0 . transgenic mice to identify novel small molecule drugs and prophylactic and treatment 
vaccines. 

The experimental strategy is schematized in Figure 9. HIV-1 is adapted to 
grow in murine tissue culture cells using both "top do\VTi" and "bottom up" approaches. 
These mutants are further evolved to replicate in hCD4-f , hCCR5-r double transgenic 
15 mice, and to cause pathogenesis. These mutant HIV-1 isolates are backcrossed to wild 
type HIV-1 isolates to obtain a virus that can replicate in the transgenic model while 
being maximally similar to wild type human HIV-1 isolates. Figure 9 schematizes the 
strategic choice tree that used to prioritize objectives and to decide when to move on to 
subsequent modules of Hiy shuffling and design. 

20 Top down approach 

In the top down approach, a mutant virus is identified that can replicate, 

however weakly, on hGD4+ hCCR5+ murine cells. This is done by testing existing HIV- 
_ Aiio^^l'^s and by constructing Ubjaries ofjwvel^I^^ 

shuffling. Initial selection is performed in tissue culture cells. Weakly replicating 
25 viruses serve as starting points for further evolution. To increase the efficiency of 

selecting a mutant virus that can be propagated in murine cells, DNA shuffling is used to . 

recombine the diversity that exists in the natural HIV population. Libraries of novel 

recombinants are generated containing mutants that are capable of replicating in the 

hCD4+ hCCR5+ murine target cells. Viral replication is quantitated by measuring p24 
30 production and viral reverse transcriptase activity. The goal is to evolve a virus that 

yields a tissue culture infectious dose-50 (TCID-50) of 1-10% the level produced by wild 
type-HIV- l-on-human cells- This approach initially-yields- weakly replica 
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Weakly replicating vims obtained directly by. DNA shuffling of ti^^ 
further evolved using recursive application of DNA- shufflm^ 

the target murine cells. . " 

' Objectives for the "top down" approach are as follows. First, methods for 

shuffling fflV4 are established -and efficiently reconstructing i^^ 
(i e. library sizes of >.10*); Shuffling of a 10 kb genome is well within the capability of 
standard DNA shuffling. Second,> mutant virus is obtained that demonsttates : 
measurable ability to replicate on murine cells, as defmed by the ability to sen . , 

.. transfertheability toprbduceviralantigensorviralR^ 
applied to improve on this activity. Viriises with weak rephcation activity are subjected 

to recursive DNA shuffling and selection in vitro in tissue culture cells, with the goal of 
-;evolVingaviraltiterthatis0.1%- 10%otthatofwildtypeffl^^^^^ These 
evolved viruses are used for further selection experiments in double transgenic mice. 

R nttom uo approach: evp liition of viral components 

The top down approach relies on whole HIV-1 genome shuffling to create 

■ initial weakly replicating vinises. tWs is technically chaUenging^b^^^^^ 
multiple blocks to replication. The following, approach also litilizes DNA shuffling, but ; 
takes a "bottom up" approach to the problem... The overall-goal is broken into^^^ 
objectives/providing an incremental pathway toward the adap^^^ 
3 transgenic mice. First.' the HIV- 1 LTR. tat and tai: Revolved for efficient function in' 
. ■ mousecells. .Second..the inutants are shuffled into Hiy-1 With a reporter 

inserted at the 3' end of the genome and further evolve the reporter virus to efficiently 
■ express full length transcripts. Third, mutants are evolved to, replicate in mouse tissue 
• culture cells and produce high titer viral stocks. Fourth, if further blocks remain after • 
15 :evolvingtat,thesearebiochemicallyandgene^^^^^ 
"^4esigned with which to evolve toecb^^ and vpr genes are examples of ■ 

individual genes that are optimized for function in mouse cells. 

These goals are chosen to define an incremental pathway whereby DNA 
shuffling is used at each step' to overcome blocks in viral replication, and in the end solve 
30 a highly complex problem that may require many mutations and hence is not readily 

accessible by any competing approach. The quantitative objectives for this second 
'approach are as follows. First, a 10 - 100 fold increase in mouse cell expression is 
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obtained for GFP reporter construct driven by tat, tar and the HIV LTR. Second, mouse 
cell expression of a GFP reporter gene encoded at the 3' end of the HIV-1 LTR is 
obtained that is 1 0 - 50% of the level expressed.by a wild type HIV-1 GFP reporter virus 
integrated into human cells. Third, viral titers of 1 - 50% of wild type HIV-1, as 
5 qiiantitated by p24 concentration or quantitative RT-PCR measurements of viral RNA in 
the supernatant in standard spreading infection assays is obtained. The kinetics of growth 
are measured with these assays to demonstrate that infectious material exists. Fourth, - 
other replication blocks are characterized as necessary. 

Two recent publications affect the strategy herein. Jones and colleagues 

10 have recently reported the cloning of a human transcriptional elongation factor that 
interacts with tat (Cell 92:45 1 -462, Feb. 20, 1 998; A Novel CDK9.Associated.C-Type 
Cyclin Interacts Directly with HIV-1 Tat and Mediates Its High- Affinity, Loop-Specific 
Binding to TAR RNA). The results of this work, presented at the March 1998 Keystone 
Symposium, showed that human Cyclin T interacts directly with tat in activating poin for 

15 elongation of messages driven by the HIV LTPv. Jones transfected this gene into mouse 
cells and showed an increase in tat in^^^^oibie gene expression. Introduction of this human 
gene into transgenic mice relieves one of the blocks to HIV repHcation. 

The use of SCID-Hu mice for studying protease and RT inhibitors in vivo 
has been reported on (J. Infec. Dis. 177:337-346, 1998). HIV can replicate in this system 

20 and known RT and protease inhibitors inhibit replication. The broad use of SCID-Hu 

mice for drug studies is limited by the high cost of producing these mice which have to be 
individually repopulated with fetal human cells. Additionally, one will not be able to 

" make use of genetic manipulation of the murine immune system,- such as CD4 and GD8 
HockoutsTinrthif ^^^^ utility of a mouse modelrfor studying- 

25 HIV. The approach herein has the potential to overcome the limitations of this model. 

Evolution of whole virus 

In one embodiment, the following steps are used to evolve HIV for 
replication in non-human cells. First, cloning vectors and protocols for shuffling 
infectious molecular clones in two non-infectious pieces are established. Second, 
30 methods for efficiently making large (>10^ complexity) libraries of infectious molecules 
from shuffled fragments ofHIV-1 are established. Third; libraries of HIV-1 
recombinants are produced using in vivo recombinatiori pathways. Foxuth, synthetic 
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HTV-l tat genes representing all clades are bred into infectious virus and .make large . 
libraries of infectious molecular clones, based on the techniques established in the first 
and second step. Fifth, CD4+ CCR5+ naouse cells are infected with these libraries and 
selected for mutants which can replicate and be passaged in tissue culture. Eight 
5 consensus tat genes representing all eight HIV-1 clades were synthesized. Sixth, mouse 
. and human reporter cells: containing constructs with GFP under the control of an HIV-1 . 
LTR were constructed. Seventh, the transcriptional activation.activities of the synthetic 
tat genes in the reporter cells are validated. Ninth, libraries of shuffled tat genes are 
constructed in an MLV retroviral vector and introduce them into the reporter cells. Tenth, . 

1 0 methods to F ACS select libraries for cells with tat induced GFP are established, and the . 
tat genes .recovered by PGR, shuffled/and reconstruct new libraries of shuffled genes. 
Eleventh, evolved tat genes, are bred into the context of infectious molecular clones and 
select for growth, on mouse cells. For Macaques, an immortalized macaque cellline is 
constructed expressing macaque CD4 and CGR5, which' are known to support efficient 

15 entry of HIV into macaque cells. This cell line is used to select in tissue culture for 
replication ofHIV in macaque cells. . ; ' 

■ Infectious HIV-1 libraries carrying shuffled gag-proteinase coding 

sequences were generated. 2.5kb fragments encoding gag-pro tcinase coding region were 
amplified from four clade B isolates and four clade D isolates. After DNasel digestion, 
. 20 500-1000 bp and 1000-1500 bp fragments were purified by agarose gel electrophoresis. 
Assembly reaction was performed using clade B fragments only, clade D fragments only, 
or a mixture of clade B and D. Because all three reactions yielded similar amounts of 
assembled products, clade B/D assembled products were used to maximized diversity of 
/ libraries. The number pfthe assetnbled molecules in the assembly reaction was estima 
25 by semi-quantitative PGR. Serial lOrfold dilutions of the assembled products (from 500- 
1000 bp md 1000-1 500 bp fragments, indicated as S and L, irespectively) md the standard 
: (plasmid carrying proviral DNA) were made and amplified by PGR. Approximately 10^^ 
moleciiles were detected in both assembly reactions. The one assembled from the smaller 
(500-1 OOObp) fragments was used to maximize the frequency of recombination. 
3Q If one can assume all chimeric molecules generated by the above shuffling 

experiment are different, the maximum library generated from this assembled product is 
10*^. However, there is no advantage to make libraries larger than the screening capacity. 
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Th5 library for is screened for replication in target cells by: Transfection of 
the library into human 293 cells, Coculture transfected cells with the target cells (monkey 
lymphocytes), Separate target cells from 293 cells, culture for 2 weeks, and passage the 
replicating virus. Because the amount of DNA used for transfection is limited (typically 
5 30|ag), the minimum amount of DNA required for producing one infectious virus 

determines the appropriate library size that can be analyzed in one transfection/infection. 
To determine the minimum infectious DNA dose, serial 10-fold dilution of the wild type 
HfV-1 DNA was used for transfection of human 293 cells in quintuplicate. Transfected 
cells were cocultured with human lymphoid cell line MT-4, to amplify infectious virus 

10 produced from transfected cells. Cultures were kept for 3 weeks to detect the end-point 
of infectivity. Approximately 10 ng DNA was required to produce one infectious virus. 
Therefore, libraries containing 3000 clones (30 |ig divided by 10 ng) are adequate. 

Based on this result, we decided to generate multiple sublibraries from the 
same assembly reaction. We made multiple aliquots from the assembly reaction, each of 

15 them containing 10^ molecules. Presumably, all chimeric molecules from aliquot#l 
should be different from any chimeric molecule in aliquot#2 (no redundancy). Each 
aliquot was ampHfied by PGR and cloned into the full-length HIV, Because cloning 
efficiency is not high, sizes of sublibraries are not 10^ but range from 5,000 to 100,000. 
These are large .enough because there is no advantage to make libraries larger than 3,000 

20 (see the previous paragraph). 

We next examined viability and diversity of one of the sublibraries. Six 
out of 40 randomly chosen clones were able to replicate in human MT-4 cells. When 
these clones were analyzed by Dral digestion, 13 clones exhibited patterns different from 
any one of the parental clones. Because Dral restriction digestion does not distinguish all 

25 parental clones (e.g. ELI, UGl 5, and Z2Z6 have the same restriction pattern therefore 
these three and chimeras between them are indistinguishable), recombination rate of 
13/40 is very likely to be underestimation. This library has enough viability and diversity 
for screening. 

Modifications can be made to the methods and compositions as herein 
30 before described without departing from the spirit or scope of the invention as claimed, 

and the invention can be put to a number of different uses. Assays kits or systems 
providin g a use of an y one of the components , methods or substrates herein before 
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described are also provided. Kits will optionally additionally comprise instructions for 
jjerforming assays or using the methods or compositions herein, packaging materials, one 
or more cont^ers which contain cpnipositions, system components, or the lik^^ 
further aspect, the present invention provides for the use bf any method, composition or, . 
5 kit herein, for the practice of any method or assay herein, and/bir for the use of any , „ 
composition, method or kit to practice any assay or method herein. ^ 

- It is imderstood that the examples and embodiments described herein a^^^ - 
for illustrative purposes only and that various modifications or changes ih light thereof 
1 0 • will be suggested to persons skilled in the art and are .to be included within the spirit and 
piirview of this application and scope of the appended claims. All publications, patents, 
arid patent, applications cited herein are hereby incorporated by reference for' all purposes, 
ks if each reference were specifically indicated to be i;v;pFppra^^^^ 
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WHAT IS CLAIMED IS: / 

1 1 . A method for generating a viral polynucleotide sequence having a genotype encoding 

2. at least one modified viral phenotype, the method comprising: 

3 contacting a cell or non-hiiman animal which does not naturally support 

4 substantial replication of an predetermined vims, with at least one initial infectious virion 

5 or replicable genome of said predetermined virus under rephcation conditions; 

6 , recovering a plurality of replicated genome copies of said predetennined 

7 virus, either as virions. or as viral genomes in polynucleotide form, wherein some;or all of 

8 the replicated genome copies comprise a mutation relative to the initial infectious virion 

9 or replicable genome; 

10 ' recombining a pluraHty of said replicated genome copies, so as to shuffle 

1 1 the mutations, thereby generating a collection of recombiried replicated genome copies; . 

12 and, . % 

43 . selecting or screening said colleciion of recombined replicated genome 



14' copies to obtain one or more replicable viral genome encoding at least one modified viral ' 

15 phenotype. v ' - ' 

1 2. The method of claim 1, wherein the modified viral phenotype is a host range or cell 

2 tropism phenotype. " 

1 3.. The method of claim 2, wherein ^the host range or cell tropism phenotype is the ability 

2 to replicate in mouse or macaque cells. . ; , * - 

1 4. The method of claim 2, wherein the host range or cell tropism phenotype is the ability 

2 ' to replicate in a transgenic mouse expressing a human CD4 protein or HIV co-receptor on 

3 lymphocytes. ; . v 

1 5. The method ofclaim l, wherein the predetemined virus is selected from HrV-l, H^ 

2 . 2, HCV, HBV and MLV. 

1 6. The method of claim 5, wherein the virus is an HIV-1 which HIV-1 is a clinical isolate 

2 which has been passaged in cell culture for less than 10 passages. 
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1 7 The method of claim 1, wherein the modified viral phenotype is an ability to replicate 

2 , in a tr^sgenic non-himian aiiimal expressing human CD4 and human CCR5. 

1 ; 8; the method of claim t wherein the transgenic non-human animal als^ • 

2 human CXCR4. 

1 9. The method of claim'l, comprising the further step of recombining a plurality of 

2 species of viral genomes of said predetermined virus, so as to shtiffle the viral genome 

3 ; sequences, thereby generating a collection oif recombined replicated genome copies, prior 

4 . to contacting the viral genome with a a cell or non-human animal which does not 

5 nahirally support' substsmtialteplicaiioh- of s'aid predeieraiined. yirns. 

1 10. The method of claim 1 , which further compn'ses a.;? iterative rscvcie of at least one 
2, interation. ' . . ' •. ■: ■. • 

! • 11. The method of claim 9, wh'erein' the steps of shuifling and selection are iteratively 

2 repeated at least twice. 

1 12. A recombinant virus capable of replicating in a cell or organisni which .is;non- 

2 permissible for replication of a'wild-type virus, and made by.the method of claim 1. 

1 , 13. The recombinant virus of claim 12, wherein the genome comprises a plurality of 

2 genome segments; wherein at least three of said genome segments are derived from 

3 ndnidentical species of viral genomes. - ^ 

1 14; The recombinant virus of claim 12, wherein the recombinant yiius has a viral genome 

2 composed of an HIV-1 genome comprising viral genome sequences from at least two 

3 clades ofHIV-1. ' 

1 15. The recombinant virus of claim 12, wherein the recombinant virus is a SHIV virus. 
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1 16. The recombinant virus of claim 1 2, wherein the cell or organism is a transgenic 

2 mouse cell or a transgenic mouse, said transgenic cell or transgenic mouse harboring ari 

3 expressible transgene encoding human CD4. 

1 1 7. The recombinant virus of claim 1 6, wherein the transgenic cell or transgenic mouse 

2 further harbors an expressible transgene that encodes human CCR5. 

1 1 8. A selected, shuffled virus having a genotype encoding at least one modified viral 

•2 phenotype. 

1 19. The selected, shuffled virus of claim 18, wherein said selected shuffled virus is an 

2 HIV-1 virus or a SHIV virus and repHcates in a mouse cell. 

1 20, The selected, shuffled virus of claim 19, wherein the mouse cell expresses human - 

2 CD4 and human CCR5 encoded on a transgene or expression vector. 
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